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ABSTRACT 





This paper describes a proposal for a revised Ccmfiler 
Monitor System If (CMS-2). Primary emphasis is placed on 
design improvements to the CMS-2 compiler and language. 
Changes tc the Moniter and Librarian which support the above 
improvements are discussed or implied where appropriate. A 
new concept is proposed, called multi-level frogramnming, 
which allows the system designer to define the levels of 
language constLructS which are appropriate for the varicus 
types of program modujies ina large self-contained software 
system. fhe approach taken is to design a language and 
compiler-ponitor system (CMS-22S) which will facilitate the 
multi-level programming concept and the top-down programming 
method of software engineering in a producticn library 
environment, 
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I. INTRODUCTION 


The origins and design capabilities of the CHS-2 System 
must be explored before proposing any steps toward a revised 
language and compiler. This section describes the history 
of CMS-2 and the goals and objectives of this paper. The 
CMS-Z System and language are briefly described in Section 
II. Section III describes the revised language and compiler 
which will be called CMS-2RS. In Section IV, a detailed 
discussion of SLKE({1) grammars, parsing algorithms, and 
CMS-2RS language analysis 1S provided. Section V describes 
the first pass of the CMS-2RS compiler and offers recommen- 
dations for intermediate language and second.pass deSigns. 


Finally, the thesis conclusions are presented in Section VI. 


Ae BACKGROUN 

Compiler Monitor System II (CMS-2) is an integrated 
group of computer program modules which comprise a Monitor 
System (MS-2) and a Compiler System (CMS-2). The Monitor 
System supervises the compiling, library, and loading 
processes. The Compiler System translates CMS-2 language 
source code into object code for any one of five target con- 
puter systems. The following sections provide a historical 
background of the CMS-2 System and language. 

1. History of Compiler Monitor System II (CMS-2) 

The primary reason for developing the CMS-2 Systen 
resulted in large part from the decision to develop the Navy 
AN/UYK-7 third generation computer. The two Systems devel- 
oped were designed to be hosted in the UNIVAC CP-642B and 
AN/UYK-7 computers. The CP-642B version generates object 
code for those two computers, as well as the Litton L-304, 
UNIVAC-1830A, and UNIVAC-1218/1219 computers. 

The CMS-2 System was built by Computer Sciences Corp~ 
Oration under contract to Fleet Combat Direction Systems 


Smepolrt Activity, Pacific ({FCDSSAPAC), in San Diego, Cali- 
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fornia. FCDSSAPAC provides System production, maintenance, 
and support services for all Navy and contractor activities 
involved in Command and Control Systems software development. 
| 2. History of CMS-2 Language 
The CMS-2 language extends the capabilities of 
Compiler System-1 language (CS-1) and includes scme of the 
features of FORTRAN, JOVIAL, and PL/I. CNHS-2 language 
capabilities were implemented specifically for Copmand 
and Ccntrol Communications problems which include internal 
and external message processing, tabie update and search, 
coordinate conversion, transcendental and hyperkclic func- 
taicns and data display processing requirements. The language 
Was initially implemented in 1969 and is now the Navy standard 
for ali Command and Controi Systems applications. 
B. GOALS AND OBJECTIVES 
This section describes the thesis objectives and the 
preject goals to implement and validate those obiectives. 
1. Thesis Objectives 
The CMS~-2 System and Language were specifically 
developed to provide a high level language processor system 
for building and maintaining deliverable computer programs 
from a production library. In its current form, the CMS5-2 
System and language are in large part capable of accomplish- 
ing that requirement, but not without some inefficiencies 
and limitations in the System, as well as redundancies, 
ambiguities,. and limitations in the language. It is with 
these deficiencies in mind that the following objectives 
are defined. 
ae Determine what ambiguities, redundancies, and 
linitations exist in the CNS-2 program, data, and control 
structures, and define the necessary structure changes to 
correct then. 
b. Develop an unambiguous Backus-Naur Form (BNF) 
description of the proposed language, as an SLR(1) Context 
Free grammar, that iS appropriate to automated compiler 


construction. 





Cc. Determine 1f an automatic SLR(1) granpar 


analysis and parsing tabie generation system can Le applied 


to the proposed language and system to sinplify compiler 
design and maintenance. | 

d. Determine what fundamental changes need to be 
made to the language and system to support the structured 
programming concept in a modular library envircnment. 

e. Determine 1f£ the CMS~2 language can ke seg- 
mented to allow design and implementation of a modular 
compiler, thus allowing ease of tuture language extension 
and pbalntenance. 

Ze Project Goals 

a. Produce an operational first pass lexical 

Scanner and syntax analyzer that will isplement and vali- 


date the above thesis objectives. 


b. DeSign an intermediate language (IL) that can be 


used ky the second pass of the compiler to generate optimized 


Object code for either a singie or muitipie address target 


computer. 
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Precio oF STEM VANDEL ANGUINGE IDES IGN 


This section describes the CMS-2 System including the 
Monitor, Compiler, Librarian, Loader, Assembler, Flowcharter 
and Tape Utility. Additionally, program, header, data and 
control structures of the CMS-2 language are described. 


A. CMSazZ sit TEN DESIGN 

The CMS-2 System consists of the MS~-2 Monitor and sub- 
ordinate systems wnose Operations are controlled by the 
mcnitor program. This section describes the CMS-2 Systen 
as presented by references 1, 2, and 3. The CMS-2 Systems 
include the following. 

ieee lies LloOnttoresysten 

The WS-2 Monitor System iS a batch-procesSing oper- 

ating system designed to control execution of its subsid- 
lary components and user jobs being run on the computer. 
The fBOnitor provides the environmental interfaces necessary 
for all programs running under itS control. These inter- 
faces include a job control card processor, an interrupt 
processor, an input/output system, an operator ccrmunication 
module, a debug module with dump, patch, and snap capabilities, 
and a job accounting package. In additicn, the mpcnitor 
Maintains a library of the system component programs and 
data base definitions that can be called by the user 
upon request to be added to hisS compile or execution package. 
The CMS-2 job flow is shown in Figure 1. 

22) Cito= 2" Coupirter System 

The CP-642B version Compiler 1S a two-phase langauge 

processor that analyzes a dual syntax source program (CS-1 
and CMS-2 languages) and generates object code for any one 
of the computers mentioned above. The phases cf compiler 


translaticn are described below and illustrated in Figure 2. 
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a. First Phase (Syntax Analyzer) 

The first phase of the compiler processes the 
user's source program which consists of CMS-2 and CS-1 
language statements and properly bracketed machire code 
instructions. The source statements are checked for 
validity, and an internediate form of the prograg (IL) and 
symkcl table are generated. 

be. Second Phase (Code Generator) 

The sécond phase processes the IL and symbol 
table tc produce the final output listings and object code 
for the target computer. Each code generator preduces the 
appropriate object code for each target computer in either 
aksclute or relocatable focmat. The code may reference in- 
put, output, debug, and built-in functional routines. These 
routines are then linked to the object code of the calling 
program either by the compiler in an absolute mode or by 
the linking-loader in a relocatable mode. The built-in 
functional routines are either added in-line to the object 
code where referenced or are added as procedures to be linked 
together at run-time. 

Syeeetio—-Z2) Ebuboari an 
The librarian is a file management systen 
that provides storage, retrieval, and correction functions 
for a programmer's Source programs and object code. 
ae Library Maintenance 

The library Maintenance or executive ccntrol 
routine (LIBEXEC) is used to create, modify, or reproduce 
libraries for CMS-2 programmers. They contain source pro- 
grams, object modules, and predefined data designs. A library 
translator routine, under control of LIEEXEC, is used to 
convert existing CS-1 programs or libraries into a CMS-2 
library format. 

the Library Sjearch 
The library search routine ig responsible for 


retrieving data from a previously created CMS-2 library. 
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4. ‘Cbject Code Leaders 
The CMS-2 System includes two loader programs for 


linking and loading object code produced by the CMS-2 
compiler. The absolute leader loads all instructions and 
data at the addresses assigned during the compilaticn. 
Siwilarly, the relocatable loader processes relocatable 
object code directly from the compiler cr from a CHS<2 
library ky assigning all program memory addresses and 
linking program segments tc froduce an executable object 
program. The loader, in the case of the AN/UYK-7, combines 
independently compiled program Segments under a common 
base register or registers. 
ae Lepe 6 Util aiy 
The CMS-2 system provides a set of utility routines 
to assist the programmer with the manipulation of data 
recorded on magnetic tape. The routines provide the capa- 
bility to construct, duplicate, compare, list, and 
reformat data files on tape. 
6. CMS<-2 Flcwcharter 
The flowcharter is designed to process unique 
ficwcharter statements ina user's CMS-2 source program 
and output a flowchart of the program logic to the 
high-speed printer. 
7. Assembler 
The asseubler, in the case of the AN/UYK-7 version, 
accepts mnemonic oriented instructions and provides a macro 


instructicn capability. 


Bo. CMS-2 LANGUAGE DESIGN 

This section preSents an overview of the major 
features of CMS-2, and its program, hneader, data, and 
control structures, as presented by references 1, 2, and 3. 

1. Major Features 

The major features of CMS-2 are: modular frocedure- 

Oriented program structures; high-level control structures; 
reentrant procedures; dynamically allocated data designs; 


Separate definition of data and control structures; state- 
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ment oriented card input processing; Fixed-point, Floating- 
point, Bcolean, Hollerith, and Status data types; and Char- 
acter and Bitstring manipuiation. 

Z. Program Structure 

A CMS-2 Program is composed of ordered sets of state- 
ments organized into header, data, and control structures. 
These structures can be grouped to form System Elements called 
System Data Designs, System Procedures, or System Keentrant 
Frocedutres. Any System Flement or Elements may Eke separately 
compiled as a System, which may be a complete-or partial 
execution package, as shown in Table I. 

The System Procedures and Reentrant Procedures are 
composed of ordered sets of Local and Auto Data DeSigns and 
Function and Procedure subroutines aS shown in Table II. 
The local procedure within a System Procedure which shares 
its name is known as the Prime Procedure and is the entry 
point for System Procedure calls. A System Reentrant Pro- 
cedure may only contain Auto Data Designs and its lccal 
procedures and functions are composed of reentrant state- 
ments, Each procedure that calls a Reentrant Precedure 
is dynamicaliy assigned a private copy of the Auto Data 
Design during execution. 

The headers are groups of statements that precede 
System Elements and specify control of compiler and loader 
processing of source and object code. 

ae Progran Statement Design 

Program statements are composed of various sets 
of symbols and delimiters which are further compesed of 
elements from the revised U. S. ASCII Standard Character 
Set. The symcols are catagorized as oferators, ccntrol 
identifiers, data identifiers, and constants. 

Operators are described by special characters 
and reserved words, and are used for unary, binary, and 
functional operations. They may operate on defined data 
fields and are used for control of program execution. Con- 
trol identifiers are described by user declared werds which 


define program lccations where program execution flew may 


16 





SYSTEM A 


a eee ee ee 


HEADER 
DECLARATICN 


SYSTEX 


LABEL ES i = 


SYSTEM B 


SYSTEM 
DATA LESIG 


EXTERNAL 
REFERENCE 


SYSTEM 
PROCEDURE 


DLO TEE 
PROCEDURE 


SY o REM 
DATA DESIGN 


SYSTEM 
PROCEDURE 


gi 


SYSTEM C 


Sou EM 
PROCEDURE 


oS) a 
DATA DESIGN 


es 


Sh Seay 
REENTRANT 
PROCEDUKFE 


SYST EM 
DATA DESIGN 


EATERNAL 
REFERENCE 


So leon 
PROCEDUKE 


SAMPLE PROGRAM SYSTEMS 





SHRM LEN SRROCEDURE A 


a ee ee a ee ee ee eee ee ee ee ee 


LOCAL DATA CESIGN 


PROCEDURE A 


PROCEDURE E 


LOCAL DATA DESIGN 


FROCEDURE C 


AUTO DATA LESIGN 


PROCELURE LC 


LOCAL DATA DESIGN 


FUNCTION E 


PROCEDURE F 


OYSTEM REENTRANT PROCEDURE B 


= ee ee eee ee ee eee tn 


AUTO DATA DESIGN 


PROCEDURE B 


AUTO DATA DESIGN 


FUNCTION A 


PROCEDURE C 


AUTO DATA DESIGN 


FUNCTION D 


PROCEDURE & 


PROCEDURE F 


TABLE ITI — SAMPLE SYSTEM PROCEDURES 


18 


ep ee ee ee ee eee ee 





be transferred. These locations may be specified by state- 
ment labels or procedure and function names. 

Data identifiers define simple variables, 
statement Switches, tables, and arrays. Constants are 
ordered sets ot numeric characters with known, fixed values. 
They may be reals (decimal or octal), Hcllerith strings, 
Status values, or Boolean. Another type o£ constant, calied 
called a Tag, is represented by sets of alphanumeric char- 
acters and can be used throughout a compile-tite System in 
place of literal constants. 

b. Scope of Identifiers 

The scope of an identifier is the range of pro- 
gram structure within which an identifier can Le referenced 
or has meaning. 

(1) Forward and Backward References. Within a 
Data Design, a data structure may be referenced either 


before (forward) or after (backward) it has been defined. 


fru 


Outside a Data Design, a data structure may only te reference 
after it has been defined. 
A reference to statement labels kithin System 
Procedures; calis to local procedures and functions within 
their System Procedure; and calls to Prime Procedures not 
having fcrmal parameters or abnormal exits may be either 
forward or backward. 
(2) Local and Global Definitions. Local 
definitions are those identifiers which can be referenced 
only from within the System Element in which they are 
defined. Global definitions apply to those identifiers 
that can be referenced both from inside and outside the 
System Element in which they are defined. 
Local Data DeSigns are local cnly to the 
System Procedure in which they are declared. All System 
Data Designs and Prime Procedures are glicbal to their 
compile-tite System. 
(3) External References and External Definitions. 
An external definition (EXTDEF) is a prefix that allows an 


identifier to be referenced outside of the System Element 
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where it is defined. All System Data Designs and Prime 
Procedures are automatically defined as external. An external 
reference (EXTREF) is a prefix that allows an identifier to 

be referenced in a System Element that is local tc another 
System Element. These external references must okey the 
local/global ranges of the identifiers they reference. 

The following cases apply. 

(a) The identifer is a data structure in 
a Local or System Data Design that follcws the reference. 

(b) The identifier is a local procedure 
Name, Statement label, or data structure in a Lccal Data 
Design, and is defined in another System Procedute. 

(c) The identifier is defined in another 
compile-time System that will be linked with the current 
System at load time. This case is identified by the prefix 
TRANSREF in the AN/UYK-7 Systen. 

(4) Declarative Delimiters. Table IiIf shows 
the deliniters which declare the beginning and ending cf the 
Varicus frogram structures. 

3. Header Structure 
The header structure contains compiler control 
declaratives which specify action as to base recister 
allccaticn modes, library retrieval and update options, 
program debug features, object code addressing, cutfput 
format and listing options, system index-register inter- 
pretation, and the computer mode of operation within 
which tke designated program is expected to run. These 
statements may be located in major headers, if the contrecl 
applies to the entire compile-time System, and in minor 
headers, if the control applies only to a System Element. 
4. Data Structure 
The data structure consists of precise definitions 
of té€mporary and permanent data storage areas, input areas, 
cutfut areas, and special data units such aS progran 
SWitches. These structures can be grouped together to forn 
System, Local, and Auto Data Designs. A data declaration 


defines the type, precision, identifier, and ofticnally, the 
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scaling and preset value of a data element in a Data Design. 
The five major data structures are switches, variables, 
Tables, Arrays and Files. An Overlay declaration ts used 
to define an identifier which is packed with the values of a 
list of variables or Fields in the order in which they are 
listed from left-to-right. 
a. Swatches and Variables 

Swatches provide for transfer of program control 
to statements or procedures depending upon the value of a 
programmer supplied index. A variable is a Singular piece 
of data of length one bit, multiple bits, or computer words. 
If the length is not specified then a default parameter is 
assumed for the specified target computer. Variakles may 
also be preset to a compatible value. 

b. Tables 

Tables hold crdered sets of identically struc- 
tured information. The common unit of data structure ina 
iago.we 25 the Tten. . 

(1) Item, An Item consists of k ccmputer 
words, where k is selected by the programmer or compiler. 
A Takle may contain n Items, where n is a declared 
parameter. Tables may be declared horizontally sucn that 
all words 0 of all Items are stored together sequentially 
follcwed by all words 1 of all Items uf to word n. The 
words of a vertically defined Table are stored such that all 
words. of every Iten are stored together seguentialiy. Item 
aSSignments are shown in Figure 3. 

(2) Fields. Items may be subdivided into 
Fields, which are the smallest subdivision of a Table. A 
Field may be a partial werd, full word, or multiwcerd sub- 
divisicn. Fields have the same data types as variables and 
may be preset and altered dynamically as variables. In 
addition, Fields may overlap each other, but must be type 
compatible. Field assignments are shown in Figure 4. 

(SypeesUbtLables;sbuke-tabtes, anda. vel-alease 
fhe Table declaration structure also allows the programmer 


to define a subset of adjacent Items within a Takkle as a 
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FIGURE 3 — TABLE ITEM AND ARRAY ASSIGNMENTS 


(Figure 3 was extracted from Ref. 1) 
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; FIGURE 4 — TABLE FIELD ASSIGNMENTS 


(Figure 4 was extracted from Ref. 1) 
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Suktakle. The programmer may also allocate a working stor- 
age area outside the table, called an Item-area, which 
autcmatically takes on the same Field format as the Table 
Item. Additionally, the programmer may declare Tables 
having the identical Field format as the parent Table, 
cailed Like-Tables, but having a different number of Items. 
The Table structures are shown in Figure 5. 
c. Arrays 
An array is a mnulti-dimensicnal extensicn ot the 
Table ccncept for storing ordered sets of identically struc- 
tured information previously defined as Items. The array 
structures are shown in Figure 3. 
d. Files | 
A File declaration defines a data structure environ- 
ment in which one or more physical files are to ke processed. 
The declaraticn asSigns a File name for dynamic statement 
rerereacing, identifies the symbolic name assigned to the 
actual hardware device, and declares that all data to be pro- 
cessed on the named hardware device 1s physically organized as 
described in the declarative statement. 
5. Control Structure 
The CMS-2 control structure or dynamic statement 
specifies processing operaticns and results in executable code. 
Fach statement consists of an operator followed Ey a list 
of cperands and additional operators. An operand may be a 
single name, a constant, a data unit reference, or an 
expression. The data units may be variables, subscripted 
Variakles, Tables, or Files. 
a. Statement and Functional Operators 
The major statement operators are described in 
Table IV. The functional operators in CMS=2 are used to 
factlitate references to and operations on data structures. 
These operators are described in Table V. 
bk. Expressions 
Real expressions may include standard 
addition, subtraction, multiplication, and divisicn 


Operators, as well as exponentiation, mixed-mode values, 
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FIGURE 5 — TABLE STRUCTURES 


(Figure 5 was extracted from Ref. 1) 
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OFERAIOR 
SET 


SWAF 
GOTO 


Tee 


VARY 


FIND 


PACK 
SHIFT 


RESUME 


RETURN 


EXEC 


STOF 


HEANING 

Performs claculatiocns or assigns a value to one or 
more data units. . The assignemnt may be Real, 
Hollerith, Status, Boolean or Multi-word. 


Exchanges the contents of two data units. 


Alters program flow directly or via a statement 
Switch. 


Expresses a Bcolean test Situaticn for conditional 
execution cr one more statements. 


Establishes a program loop to repeat execution of 
one or more statements by varying one or mcre 
imcexes@ryva’ specified increment until a test 
value is satistfied. 


Searches a table for data that satisfies specified 
conditions, and assigns to subscript variafles the 
andex values pointing to its location. 


Transfers bits strings into a data area. 


Shifts a string of bits right or left a specified 
amcunt, 


Specifies a transfer of execution ccntrol to the 
increment and test steps within a VARY block, 


A transfer of executicn control frem a procedure 
call that specifies normal return to a label if a 
Special condition is met, or a return of the value 
or an expression to the point of call. 


A call to an executive program which passes one OL 
twce parameters indicating what action to be taken 
and on what data unit or address. 


Temporarily sespengs program execution until 
manually restarted cn the computer. 


TABLE IV + CONTROL STATEMENT OPERATORS 
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= ae eee 


LENCTE 


CNT 
CAT 


lo 
bri 


tn fret 
A 


ATOR 
ELAY 


D 
SNAE 
RANGE 


TRACE 


PTRACE 


a ae ee oe 


MEANING 
To reference a string of bits in a data unit. 


To reference a string cf characters in a hollerith 
data unit. 


To reference a data unit's relative core address. 
To obtain absolute value of an arithmetic primary. 


TO move a magnetic tape record, file a specified 
number of positions forward or backwards. 


To obtain a record's length for the last input- 
output operation. 


TOmObDtainececunt of bits Set Ho One inva data. init. 


To concatenate character strings. 
TABLE V - FUNCTICNAL OPERATCES 


MEANING 

Causes the contents of machine De da eoes and 
Specified data units to be formatted and printed 
on the system Output device. 


Contents of a data unit are printed and stored. 
Subsequent executions cause a printcut only when 
the data contents are modified. 


ey and low values are specified for a data unit, 
and e€ach time the data unit is modified in the 
program, a message iS printed if the value falls 
Outside the range. 


A printout is generated for the execution cf each 
CMS-2 statement between TRACE and END-TRACE. 


Each CMS-2 procedure call encountered in the. 
pregram being executed is identified by calling 
and called procedure names. 


TABLE Viit - FROGRAM CLEEUG STATEMENT OPERATORS 
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and in~line redefinition of the scaling of fixed-point 
humbers. | 

; A relational expression performs a comparison 
between two Similar operands as specified by a relational 
Cperator. There are four types of comparisons: 

(1) Real, involving the ccmparison cf signed 
Real values (fixed, floating, or mixed), 

(2) Hollerith, involving a left-to-right, 
character-by-character ccmparison, 

(3) Boolean, involving the comparison of 
Single bits, and 

(4) Status, involving the comparison of status 
values. 

A CMS-2 expression may include algetraic, 
relational, and Boolean operators. The heirarchy of 
expressional evaluation is shown in Table VI. 

ce. Input-output Statements 

The CMS-2 Input-output statements permit the 
program to communicate with the various hardware devices 
while running in a monitor environment. The operators are 
described in Table VII. 

d. Program Debug Statements 

The CMS-2 Dekug Statements are placed in the source 
language to cause run-time debug routines to be availiable for 
program execution analysis. The debug statement cperators are 
descrikted in Table VIII. 
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TABLE VI 


ook = 


tA 


Ci LEC 
GTEQ, LT 


COMP 


AND 


OR 


XOR 


DEFINITION 
EXFONENTIATION, UNARY MINUS 


MULTIPLICATION, DIVISION 
ADDITION, SUBTRACTION 


EQUAL, NOT EQUAL 
GREATER THAN, LESS THAN OR EQUAL 
GREATER THAN OK EQUAL, LESS THAN 


LOGICAL COMPLEMENT 
LOGICAL HULTIPLICATION 
LOGICAL SUM 


LOGICAL DIFFERENCE 


- CMS~-2 MIXED EXPRESSION EVALUATION 
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ee epee py 


CJL a 


INPUT 


OUTEUT 


FORMAT 


ENCCDE 


DECCDE 


ENDEILE 


CHECKID 


SEAKCH 


MEANING 


Defines the environment and pertinent information 
concerning an input or output operation, and 
reserves a puffer area for record transmissicn. 


Prepares an externai device for infut, output, or 
scratch (both) operations. 


Deactivates a specified file and its external 
device, sends last unfinished buffer to outfut. 


Directs an input operation from an external device 
to a FILE bufrer area. 


DirectS an output oferation from a FILE buffer 
area to an externai device. 


LDefines the desired conversicn between 
data blocks and internal data definitions. 


Directs transformation of data elements into a 
common arrea, With colversion in accordance with a 
specified FORMAT. 


Directs unpacking of a common area and trans#ittal 
to data units as specified py a FORMAT 
declaration. 


Places and end-of-file mark on appropriate 
Tecording mediums. 


Directs checking an id header or label on a FILE 
if the device 15 at load point. 


Directs a search operation for specific data 
within a FILE. 


TABLE VII —- INPUT-OUTPUT STATEMENT OPERATORS 
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Tif. REVISED COMPILER AND LANGUAGE DESIGN 


This section describes the CHS-2RS Ccmpiler System and 
Language design capabilities which differ from the CMS-2 
System. A more complete description of the CMS-25S language 


is provided in Appendix B. 


A. EFROGKAM STRUCTURE 

In a large system of programming modules, the problems of 
isolation of logic design error and verification cf rrogran 
correctness tend to increase exponentially as the number of 
combinations oi program procedure call parameters and trans- 
fers of execution control increase. To counter this problen, 
a well-defined metnod of software engineering and Suitable 
language-compiler system are needed. 

1. Structured Programming 

The term "Structured Programming" is defined in ref- 
erences 4 and 5 as a top-down method of program building with 
the top program modules being at the highest level of the 
program logic design and referencing the next level modules 
as procedure calls in its control structures. This process 
continues until the modules at the lowest level are refer- 
encing baSic machine and operating system constrvtcts. 

. In reference 6, Bohm and others have shows tnat the 
control logic of any programming problem can be represented 
by the three basic control structures: simple statement 
Seguences, IF THEN ELSE statements, and DO WHILE statements. 
The CASE structure of ALGOL W may also be required to prevent 
nhultiple nesting of IF THEN ELSE statements. 

To ensure program reliability under varying loading 
conditions, its logical correctness must be easily verifiable. 
The first requirement must therefore be to restrict each 
Program segment to exactiy one entry and exit pcint. Another 
reguirement is that library substitution facilities be 


provided at compile and load-time so that the segemnts can 
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be stored and retrieved by symbolic names. Finally, scope 
of identifiers and GOTO statements must be contrciled in their 
application to prevent unlimited access to data and control 
structures. 
zs Program Structure Revisions 
The program structure of CMS-~2RS retains the modular 
structure of CMS-2, but includes several revisions to its 
conventions for procedure exit, and identifier referencing 
to facilitate igplementation of the tcep-down programming 
concept. 
a. Procedure Exit 
The convention of allcwing unrestricted abnormai 
exits or returns to a calling procedure has been modified. 
A return which precedes the point of call may result in 
an infinite loop condition, given the abnormal exit condi- 
tion still exists. Abnormal exits have therefore been 
restricted to return points which follow the point of call 
and lie within the calling procedure. 
b. External Referencing and External Definition 
The concept of externally defining an identifier 
in cne element to be externally referenced by another ele- 
ment destroys the integrity of the System Procedure and 
its Local Data Designs. A program logic error related to the 
above concept may be very difficult and costly tc locate. 
Theretole, external references are only allowed ketween 
different compile-time Systems and then only by specifying 
the identifiers of System Data Designs; System Frocedures 
or Reentrant Frocedures. The prefix EXTREF now arflies 
to System Data Designs in symbol table format and to System 
Procedures in source code format, and at load-time to System 
Procedures in object code format. The Elements thus speci-~ 
fied are then added as complete sections to the compile or 
load-time System aS appropriate. 
If it is necessary to reference another System 
Procedure's Local D&éta Design structures at compfile+time, 
then the data structure must te removed and put into an 


apprcpriate System Data Design so that it is glotal to 


Be. 


both procedures. The external definiticn, EXTDEF, and 
transient reference, TRANSREF, prefixes have been eliminated 
to simplify language and compiler design. 

Cc. Forward and Backward References 

The forward reference of identifiers tithin 

a data design is not necessary and has therefore keen 
deleted. The Tag declaration, which is used tc name a 
constant throughout compile-time, is sufficient to allow 
presetting of ali data structures within a data Gésign. 

d. Local and Giobal Definitions | 

The local and global definitions of identifiers 
allow program structure to develop in a logical fashion. The 
System Procedures at the lowest physical program level have 
access to their Local Data Designs and all other ;reviously 
deciared System Data Designs. The System Procedures at the 
highest level in the program, however, have access cnly toc 
their Lccal Data Designs and all System Data Designs declared 
above. This concept cf scope of identifiers prcvides for 
icplementation of the structured programming concept by 
allowing the top procedure to be written at the highest lcgic 
level and the bottom at the lowest level as Shown in Figure 6. 
What is required though, is a means of frescribing what levels 
of ccntrol structures (how close to the machine) ¢hculd be 
allcwed within each System Procedure in an e€xecutable Systen. 
3. Multi-level Programming 

A Multi-tevel programming system is one in which all 
the language constructs are expressed on a high iéxical 
level. They are, however, capable of implementing systems 
as well as applications programs. If these constructs are 
to ke ccntrolied in their use in all types of programming, a 
matrix of levels of programming verses language ccnstLructs is 
required in the syntax analyzer to allow only the appropriate 
code to be generated for each programming level. The Syntax 
analyzer can thereby ccntrol which grammar reductions are 
allcwed at each programming level in a System. A Lurther 
requirement exists to define in the Monitor System a matrix 


specifying which levels are accessible by each user, module 
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and corresponding project. The resulting systes allows a 
programmer to specify that other levels cf program modules, 
for which he is not authorized to generate code, are to be 
added at the appropriate level to his cconpile~time or load- 
time System. The concept of System executive cr superviscr 
state and task state are thereby expanded to levels of states. 
For example, the use of System Registers as variaklies in 
expressicns and assignment statements is limited tc those 
levels which reguire register access such as I/C routines. 
Thus the Monitcr System has ccmpiete ccntrol over which user 
generates what level of code for which System and provides 
a means of security over use of progran ccntrol ccnstructs. 
This ccncept is iliustrated in Figure 7. 
YW. Declarative Deiimiters 
Table IX shows the revised delimiters which declare 


the Eeginning and ending cf [rogram structures. 


Er HEADER STRUCTURE 
The minor header deciaration has been eliminated and 
Majcr headers now only apply to the entire compile or load- 
time System. The header declarations were not ispiemented 
in the current CMS-2kS compiler, but could easily be included 
in the same manner as the data declarations. The following 
CMS-2 statements have not been implemented in the header 
structure in order to Simplify the CMS-25FS language and 
compiler. | 
1. EQUALS and NITEMS Statement 
The Equals statement is used to assign numeric values 
to variabies and to specify their allccation relative tc 
cther variabies. 
a. Numeric Assignment 
The assignment of numeric values to symbols by 
Slmple farenthesis-free expressions requires the céeneraticn 
cf compile-time ccde to preset symbols to the desired values. 
This practice is not necessary since symbols can Le initial- 
ized in the data design by presets or in the prcecedure itself 


by dynanzic code. Allowing this method of pre-initialization 
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BEGIN DELIMITER 
oe a 
SYSTEOR 


SYS_DD 
SYS_PROC 
SYS_PROC_R 
PROCEDURE 
FUNCTION 
LOC_DD 
AUTC_DD 


HEAD 


TABLE IX 


END DELIMITER 
END 


END_DD 
END 


END 


END_DD 
END_DD 


END_HEAD 


Compile-time Systen 

systen Data Design 

System Procedure 

System Reentrant Procedure 
Local Procedure 

Local Function 

Local Data Design 

Auto Data Design 


Header Declaration 
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intreduces a possible source cf error that may be difficult 
to detect and violates the structured programming ccncept of 
Frogitam Segment integrity. . Further, initializing symbols at 
compile-time requires that header declaration infcrmation 
always be a part of the System Procedure or Data Lesign in 
order to maintain library configuraticn control. 

b. Relative Ailccation 

The rejative allocation feature allows a data unit 
to be assigned the same relative locaticn as ancther data 
unit. This relative allocation capability can also be 
accomplished at run-time by tne CORAD functional cferator, 
thus preserving library segment independence. 

The NITEMS declaration specifies that an identifier 
is tc be assigned a value fcr compile-time purposes, but may 
also be assigned ancther value at load-time. It is used only 
for establishing the number of items in a vertical Table whose 
size is to be determined at joad-time. 

The EQUALS and NITEMS statements were nct inplemented 
Since mcst of their capability could be more easily provided 
by the Tag declaration, 

ie CSWITCH Statement 

The CSWITCH (compile switch) statement prevides se- 
lective compijation of biocks of statements within a System 
Eiement. The CSWITCH parameter identifies a grcup of state- 
ments within a CSWITCH block. In the header, the list of 
parameters rollowing the CSWITCH identifier specifies to the 
compiler those CSWITCH blocks which are to be compiled in the 
System Element or Elements that follow. The parameter 
DELETE specifies that all CSWITCH code biocks in the follow- 
ing Element or Elements, whith are not keing ccmfiled, are 
to ke deleted from the scurce output and listings of the 
compiler. 

The CSWITCH feature was not implemented since the 
abiiity to selectively delete code frem conmpilaticr complicates 
the grebplem of program segment configuration controi and in 


some cases duplicates the library CORRECT? feature. 
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3. DEP.And SEXECUTIVE Statements 

The DEP statement inforns the compiler of a list of 
System Elements which are dependent or subordinate to the 
System Element fcliowing the header declaration in a compile 
or load-tine System. 

The EXECUTIVE statement functicn informs the 
compiler that the fcllowing System Element or Elements are 
allowed to generate control memory references tc index reg- 
isters and accumulators uSing the symbolic identifier speci- 
fied in the System index (register,name) declaration. 

These statements will not be necessary if a multi- 
level compiiing concept is implemented. Instead what is re- 
guired is a declaration which informs the compiler cf the 
program level and project type of the fcllowing Element or 
Elements. . 

4, SPILL Statement 

The SEILL statement causes every identifier within 
a System Frocedure to be declared as an External Definition. 
This statement has not been implemented Since its capa- 
bility can be achieved by relativley simple pregrar changes. 

Jo ubmary 

In summary, the header declarations should be used 
to inform the compiler of the type of Element aliccaticn 
(static or relocatable) and the type of output reguired for 
the compile and load-time Systems. They should nct be used 
to medify the Elenent's internal data allocatien and control 
Structure representation. Header statements are also used to 
specify library retrieval-correction, debug, and data design 
pooling changes to the compile or load-time Systems! default 


parameters. 
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CeeeeLALA STRUGTU HE 

The data structure of a program must allow a compact 
definition of a data unit's attributes and provide for fre-~ 
setting it to a constant or Tag value. The following changes 
have been made to simplify CNS-2RS. 

1. Data Types 

The data types availabie in CMS-2 do not expiicitly 
includgeé a type that allcws operaticns on the matnematical set 
of binary digits. The data type Boolean is used primarily 
for the logical operations of conjunction, disjunction, and 
negaticn. Therefore, it has been limited to a fixed length 
set which is defined as either True or False and no longer 
can ke used.for bitstring manipulation. 

TO provide a Bitstring capability, a nex data tyrpe 
has keen defined, called Bits, which allows the lagical 
operaticns of “and", "or", complementation, aS weil as SHIFT 
and sukstring functional oferations. These operations pro- 
Vide the capability for programming at the bit level in 
additicn to programming at other data tyfe levels. 

The data type Integer, as well as Fixed-point and 
Floating-point, are now treated as Reals and may cnly be used 
in Real expressicns. 

2. Data Flements 

Ihe data element constructs in CMS-2RS provide a 
means of organizing data into the most ccmmonly used struc~ 
tures. with the exception of lists. 

a» Switches 

The Index switch declaration for statements is 
compesed of an index variable followed by a list of n state- 
ment labels to which control may transfered depending upon 
the value of the index (0 to n-1). An alternative form of 
the declaration contains two index variakles and a column of 
labels fcr each. The item switch declaration is composed of 
a variable followed by a list of constant statement-label 


pairs. 
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The Procedure switch declaration iS composed of 
procedure names to which calis may be made, and a list of 
forall input and output parameters for each procedure. 

The statement Switch and precedure switch 
Capakilities have been incorporated intc the CASE state~ 
ment. 

b. DATA and MODE Declarations 

The DATA declaration has been incorporated 
into the data unit preset capability in the data designs. 
To facilitate this, a TAG declaration has been added which 
allows the programmer to aSSign a constant value to an 
identifier which can be used throughout a compile-time 
systen. ; 
The MODE declaration, which speciifes that sStb- 
seguent data declarations have the same data tyfe, has been 
superceded by the reguirement that type specifications appear 
in every data declaration. 

ce Variables 

The precision specification for variaties is ne 
longer cptional. It must now be specified and each code 
generator wiii translate it into the appropriate length for 
its target machine. 

The V(x,y) specification reguires the ccmpiler 
to preset-scale a variable starting at the y bit position 
by defining that position with a magnitude of x and each sub- 
seguent position as one half the preceding positions! value. 
This capability has been provided by the Bits data type 
€XPressicns. 

d. Tables 

The option tc store one dimensional Tables in 
either vertical or horizontal alignment has not reen imple- 
mented in fayor of aligning all Items vertically (all words 
stored sequentially). An optimum Search capability is now 
Frovided Fy the compiler that allows Table searching to tLe 


independent of Item word storage alignment. 
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The Optional counter (major index), which keeps 
track of the actual number of items within the takle, has 
not been implemented. This feature was programmer maintained 
and coulda just as easily be accomplished by a normally de- 
clared integer variable. 

Like-tables have not been implemented since the 
same capability can be achieved by declaring either another 
Table with a different number of items cr a Sub-table with 
the desired dimensions. 

The Supb-table ccncept has been extended to in- 
clude arrays or muiti~dimensional Tables, as well as one 
dirensicnal Tables. 

‘ The INDIRECT Tatle implementation has been 
simtlified ty rot recuiring the CORAD functional operator 
in making procedure calls by address. The fact that the 
formal input parameter is an INDIRECT Table name cr that the 
corresponding formal and actual input parameters ate of 
the same name is sufficient ta achieve a procedtre call in 
Which no values are passed. Further, the INDIRECT cption 
may not be used aS an actual parameter. 

The Field declaration has keen changed to allow 
only tyre "b" word packing, that is the compiler assembles 
the Fields within Table Items sequentially in the order in 
are declared. 

En Overlay 

The Overlay declaration has not been impiemented 
since dynamic packing of a variable with a list cf variables 
can ke accomplished by the PACKIN instruction. The static 
packing of variables is of guestionable value in a language 
with a wessage processing capability and requires an un- 
necessarily complicated compiler data declaraticn capability. 
If a list processing capability is required, it should he 
added as a complete feature of the language and compiler 
including list declarations and logical set operatcrs such 


as union, intersection, add-to, and delete-fronm. 
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De CONTROL STRUCTURE 
The CMS-2RS control structure provides the cafability to 
specify parallel as well aS sequential frocesses. Further, 
a structure has been provided which groups statements into 
blieccks for seguential prccessing. This section describes 
the control statements whith have keen added te the CMS-2KS 
language to implement the above capabilities. Additional 
changes have been made tec provide the capability to implement 
the structured and multi-level programming concerts. 
1. BEGIN and END Declaratives 
The BEGIN and END block structure declatatives, which 
are described by Bauer in reference 7, have been imple-~ 
mented to allow grouping of statements for Sequential exe- 
cuticn. There is no imposed limit on the numbexr cf levels 
ot Elock structure that can be nested within a System Pro- 
cedure since data deSigns are declared separately fron 
eoncrol]l structures. 
2. VARY Statement 
The VARY Statement has been changed to ailow an 
cpticnal increment value (other than 1) for each index 
Variable on the same lcop level. The loop execution will 
cease cnly when all indexes have reached their test value. 
3. CASE Statement 
The CASE statement, which is described by Bauer in 
reference 7, has been added to the language to frevide the 
capability for seiective statement execution. A case index 
value is compared at run-time to a list of case labels to 
determine which statement will be executed. Each statement 
in a case block may have more than one label and labels may 
be eitaer Integer or Status constants. An ELSE case cafa- 
bility has also been provided in the event the index value 
falls outside the range of the case label list. An alternate 
version of the CASE statement has been implemented which 
does not require statement labels. An implied lakeling seq- 
uence of 0 to n-1 1S used to select one of n Statements in 


a case LElock. 





4. While Statement 

The While statement has been added to frecvide for 
locrE controlied execution of a statement or biock of state- 
ments based upon the continued True condition cf a Boolean 
EX PpLESSicn. 

5. IF Statement 

The IF statement has been modified to aliow nesting 
of IF THEN ELSE statements. Only simple (not CASE, VARY, 
WHILE, or IF) statements are allowed within the nesting 
structure. All types of statements, however, may be used 
for the last statement after the last ELSE. 

The "JF DATA FOUND THEN" and "IF DATA NCTECUND 
THEN" statements have not been implemented since the in~ 
dicies cf a Table or File in a SEARCH statement will be set 
to minus one if the search is unsuccessful, otherwise they 
will be set to the location of the desired data unit. 

The "TF (subscripted data structure) VALID/SINVALID 
THEN" statements have not been inplemented since the Boolean 
functicnal operator SUBSCRIPT, with parameter subscripted 
data structure, provides a True of False indication of sub- 
script range validity, and can be implemented by using it 
Within an "IF THEN ELSE" statement. 

The "IF (data unit) OLDDP/EVENP THEN" statements 
have not been itplementedad since the COUNT(data unit) func- 
ticnal operator provides the same capability when checked 
for €ven or odd value in an IF statement. 

6. Label Declaratives 

Labels are alloyed on every statement type except 
statements within VARY, CASE, RESERVE, and WHILE statement 
blocks. Multi-level programming, however, may ke used to 
restrict their use to specified program levels. 

7. SEARCH Statement 

The SEARCH and FIND statements have been con- 
bined tc simplify searching a Table or File structure fcr 
a particular data unit. If found, the index variables in 


the data sturcture's dimension list will be set at run 
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time to point to the data unit's locaticn, otherwise they 
will ke set to minuS one. 
8. UNPCK Statement 
The UNPCK statement has been implemented to allow 
a data unit to be unpacked into a list cf data units in 
seguence from left-to-right. If the combined length of the 
receiving data units is longer than the source, then they 
are filled with zeroes. 
G9. Frocedure CALL Statement 
The procedure CALL statement has been changed to 
allow calis by value, value-result, result, address, and 
address-result. Ail input and output actual parameters 
are cptional and need not agree in numkeéer, but gust agree 
in type. An expression may be used aS an input actual para- 
meter but only a data unit may be used as an output actual 
Farameter. 
10. GOTO, RETURNTO, and RESUME, Statements 
The GOTO, RETURNTO, and RESUME statements have keen 
implemented to allow transfers only to labels within a 
System Procedure. Tne GOTO and RETURNTC statements, with 
special hardware condition checks, have not been implemented 
Since the same capability is available by using the INTERRUPT 
Poolean functionai operator within an IF THEN GCIC statement. 
The GOTC statement with an Item or Index-switch name check 
has not been implemented Since the Same capability is avail- 
abie with the CASE statement. 
11. RETURN Statement 
The RETURN statement has been changed to allow trans- 
fer with a data unit instead of transfer with the result of 
an evaluated expression. 
1z. INPUT and OUTPUT Statements 
The INPUT statement has been inplemented to alicw 
Loving of Character and Binary records from READ, OCM 
(Operator communication medium), and user declared Files to 
the following user specified receiving data units: Tables, 
Pields, Items, or variables. The OUTPUT statement has been 


implemented in a similar manner for PRINT, PUNCH, OCM, and 
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user declared Files. Both statements allow data to he 
moved back and forth frem internal to external structures. 
13. EXEC Statement 
The EXEC Statement has not been implemented since a 
normal call tc a procedure, with the proper input parameters 
and at the appropriate programming level, accomplishes the 
same effect. 
14. STOP Statement 
The STOP statement, with a hardware special condi- 
tion check, has not been inplemented Since the Same capa- 
bility is available by using the INTERRUPT functicnal oper- 
ator in an IF statement. 
15. COBEGIN and COEND Declaratives 
The COBEGIN and COEND declaratives which are des- 
Cribed by Hansen in reference 8, have keen implenented to 
allow simultaneous executicn by more than one processor of 
several statements including procedure calls. Each process 
way refer to shared rescurces such as variables, subscripted 
Variables, File records, and non-reentrant procedures only 
if their values are not changed. Reading from and adding 
to the end of data structures, however, are legal accesses 
to a shared resource. 
GOTO statements are not allowed within a COBEGIN- 
COEFND bicck to prevent errors arising frcem the transfer of 
control from one process tc another. in additicn, COBEGIN 
and COEND declaratives may be nested to any level within a 
System Frocedure or Reentrant Procedure. Thus, they may 
be used to describe any combination of parallel and sequen- 
tial processes. 
16. RESERVE and WAIT Statements 
The KESERVE statement has keen added te frecvide a 
teans fcr the prcegrammer to specify critical sections in 
each process within a COBEFGIN-COEND block. The parameters 
for the KESERVE statement are those rescurces whichaA may 
be change~accessed by two cr more processes. Each resource 
thus specified, is identified as belonging to a particular 


Process. If another process desires access to the same 
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resource then that process will be engueued on a wait list 
for later access when the resource is freed. The WAIT state- 
ment has keen added to allcw synchronization of processes, 
that is, to specify that the controlling procedure is to 
suspend execution until one or more procedures have con- 
pleted their execution. These statements are intended to 
alicw the programmer to explicitly control the execution cf 
parallel and synchrcnized [frocesses. 

17. SET Statement 

The SET statement allows assignment of evaluated 
expressions and Table values to a compatible data unit or 
list cf units including other Tables. The data units allowed 
are variables, subScripted variables, and Tabies. Table 
asSignments must be size and Item-length conpatible or the 
desired result may not be obtained. Subscripted variables 
Rust Specify Field name, Table name, and dimensions. 

The "SET (receptacle (s)) TO (exfEression) THEN" 
statement has not been implemented Since the sane capability 
is available by using an IF THEN SET statement. 

The "SET (receptacie(s)) TO (expression) SAVING 
(Real data unit) OVERFLOW ({iabel)" statement has not been 
implemented since the same capability is availakle by using 
the KEM operator within an IF THEN GOTO statement. 


a EXPRESSIONS 

The expression tyres implemented in CMS-25ES include 
Real, Bits, Status, Character, and Boolean. Real E€xfpressicns 
include Integer, Fixed-foint, and Floating-point data types. 

1, Real 

In Real expressions, a binary cperator has been 
addec which gives the remainder after division of two real 
Frimaries (REM). In addition, two functional operators 
have keen added: The operator NUMBER, with parameter Bits 
primary, converts a Bits value into an Integer value; and 
the operator CHAKCODE, with parameter Character frimary, 
converts a character syrbcol code into an internal integer 


repLesentation. 
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The type conversicn conventions for mixed operand 
exEressions are, 

a. Decimal and cctal operands will be converted 
to decimal results. 

b. Integer and Fixed-point operands will be con- 
verted to Fixed-point results. 

c. Integer and Floating-point operands will be 
converted to Floating-point results. 

d. Fixed-point and Floating-foint operands will 
be ccnverted to Floating-point results. 

Where the receptacle is of smaller precision than 
the evaluated expression, the next least significant digit 
Will be rounded and the remaining digits truncated. 

z. Bits 

The SHIFT statement has beer replaced Fy the follcw- 
ing unary operators in Bits expressions: shift left-logical 
(SHLL), shift right-logical (SHRL), circular end-arcund 
shift left (CIRSHLL), and circular end-around shift right 
(CIRSHRL). 

{The binary operators, logical and (ANDL), lcgical 
or (ORL), and logical complementation (NCTL), have been 
implemented to allow logical operations on Bits cferands. 
If the cperands are of uneyual length, the shorter cne is 
Light justfied and filled with zeroes cn the left before a 
binary operation is completed. 

_ fhe functional oferator BIT has been renamed 
BITSIRING, but performs the same functicn of ccnverting 
a Real frinary to a Bits value and extracting a specified 
number of binary digits from it starting at any fcsition in 
the string. 

Ss Character 

Cne functional operator has been added tc Character 
e€xfressicns. The CODECHAR operator, with parameter Real 
primary, converts an integer value intc a single character 
Symkcl code. The CHAR operator has been renamed SUECHAR, 


but performs the same function of extracting a specified 
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number of character symbols from a Character primary start- 
ing at any character position in the string. 
4. Boolean | 

The unary operator COMP or comflementaticn has been 
renamed NOT. The binary operators alicwed in Eccolean expre- 
ssicons are conjunction (AND) and disjunction (OR). 

The comparison Operators in relaticna] exfressicns 
have been changed to: equal (=), not equal (-=), less than 
(<), greater than (>), less than or equal (<=), and greater 
than or equal (>=). fhe following restrictions are placed on 
operand ccmpariscns: keal operands are converted to the same 
type before they are compared, Bits operands of all lengths 
are compared from right-to-left and only for equal or not 
equal, Character operands are compared from left-to-right 
for ali léngths with longer strings having a value of greater 
than if all other characters are equal, and Status operands 
are compared by their Integer value. Ina Status variable 
declaration, the first Status constant associated with the 
Variakle is assigned a value of zero increasing Ly incre- 
ments of one for each additional Status constant. 

If an expression involves operations of the same pri- 
ority they are performed from left-to-right. In parenthe- 
Sized expressions, the innermost fparenthesized expressions 
acre evaluatea tirst. A hierarchy of evaluation is Shown in 
Table xX. 
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PRICRITY OPERATOR DEFINITION 


ae ee ee ee | ee eet eee ee 


1 ABS,7 ABSOLUTE VALUE, UNARY MINUS 
CAL’ STRING CONCATENATION 
NOTL > aa BITSTRING COMPLEMENTATION 
iz *% EXPONENTIATION 
SHLL SHIFT LEFT LOGICAL 
SHRL SHIFT RIGHT LOGICAL 
CIRS ALL CIRCULAR SHLL 
CIRSHRL CiRCULAR SHRL 
3 *.f/ MULTIPLICATION, OCLVISION 
REM REMAINDER AFTER DIVISION 
ANDL LCGICAL AND 
ORL LOGICAL OR 
4q | es ADDITICN, SUBTRACTION 
3 UAL, NOT EQUAL 


_ oH 
<)>= LESS THAN, GREATER THAN OR EQUAL 
> <= GREATER THAN, IESS THAN CK ECUAL 


6 NOT NEGATION 
v AND CONJUNCTION 
OR DISJUNCTION 


= 


TABLE X - CMS-2RS MIXED EXPRESSION EVALUATICN 
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IV. LANGUAGE ANALYSIS 


The following sections describe Simple left-to-right 
parsing, with k symbols of look-ahead, Ccntext-Free gran- 
mars (SLR (k)), an SLR(1) parsing algorithm, and SLR(1) syntax 
analysis of CMS-2RS. 


A. SIR(k) CCNTEXT-FREE GRAMMARS 

The class cf LE(K) grammars is the largest class of 
grammars which can be naturally parsed left-to-right and 
bettcm-up using a deterministic pushdown automatch. Among 
the various sub-classes of LR(K) grammars are precedence 
and Founded-right-context grammars. Since LR({k) grammars 
have these properties, CMS-25S was defined as an SLR(1) 
context-free grammar. This section describes context- 
fiee grammars and SLR(kK) graitiars aS presented ry reif- 
erences 10 and 9. 

1. Grammar 

A Grammar is defined by a quadruple of sets of ter- 

Minal symbols (VT), non-terminal symbols (VN), a start symbol 
(S), and productions (P). The letters VT denote a finite 
set cf terminal cr non~reduceable symbols from the vocabu- 
lary (V) of the grammar. The letters VN denote a finite 
set cf non-terminal or variable symbols from which various 
strings of terminals and non-terminals can be derived. 
The letter S denotes the start non-terminal symukcl from 
Which all strings of terminals in the grammar are derived. 
The letter P denotes the finite set of froducticns cr 
relaticns between the left part symbols in VN and the right 
part strings of symbols in V*, where V=VN U VT and V* de- 
notes the set of all strings composed of symbols in V 
including the empty string. 

z- Context-free Gratnars 


A Context-free grammar 1S one in which €very fro- 
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duction is of the form v=>W, where v 1S a Single variable 
and w is any string other than tne empty string. 
3. Direct Production or Reduction 
| For a grammar G, a string v Directly Froduces a 
string w or w Directly Keduces to v, written v=>w, Beans: 
if U=>u is a rule of G, and x and y are strings which may 
be empty, then v=xUy, and w=xuy. 
4G. Eroducticn or Reduction 
A string v produces a string w cr w reduces to v, 
written v=>+w, means: if n>0O and nis the number cf produc- 
tions frem v to w, then v=u(0)=>u (1) =>u(Z) =>...=>u(n) =w, 
where it v=wW or v=>+w then v=>*w, and u(i) denotes the ith 
production. 
5. Sententiai Form 
A string x is called a sentential form if x is derivabie 
from the start symbol S, written S=>*x. 
6. Sentence . 
A sentence iS a sentential form consisting only of 
terminal symbols. 
7. Language 
The language of the grammar, written L(G(S)), is the 
set cf sentences: L(G) =(xj|{S=>*x and x is in VT+t). 
€. Ehrase 
If w=xuy is a sentential form then u is called a 
Fhrase cf the sentential fcrm w for the non-terminal U if 
U=>+u and S=>*xUy. In addition, u is calied a sinm;zle phrase 
if S=>*xUy and U=>u. That is, not only must a reduction be 
possible using a production in the grammar, but the reduced 
sentential form must also be derivable from the start symbcl. 
In addition, u is called a simple phrase if S=>*xUy and U=>u, 
that is, not only must a reduction be possible using a pro- 
ducticn in the grammar, but also the reduced sentential form 
must be derjvable from the start symbol. 
9. Canonicai Derivation 
A direct derivation xUy=>xuy is canonical, written 


xUy=j>xuy, if y contains only terminals. A derivation w=>+v 
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is canonical, written w=jJ>v, if every direct derivation in 
it is canonical. 
10. Farse 
A Parse of a sentential form is the construction of 
ifs derivation. 
11. Left-Right, Bottcn-Up Parse 
In a left-right, bottom-up parse, the leftmdst 
Simple phrase (the Handle) of the current sentential form is 
always reduced. fThus, the string to the right cf this 
phrase always contains only terminals. 
1z. Rightmost Derivation 
A rightmost derivation is one in which at each stef 
the rightmost variable in the sentential form is replaced. 
13. $LR(K) Grammar 
A Context-Free grammar is LR(kK) if for any sentential 
form w=xuy the foliowing holds: there is a unigue form for 
Ww, and there is a rightmost derivation S=>*xUy=>xuy, where 
U is replaced by u at the last step, and U and u can be 
uniguely determined by scanning w fromr left-to-right up tec 


a pcint at most k symbols beyond u. 


Be. SLR(k) PARSING ALGORITHM 
This section describes Context-free LR (0) and SHR) 
grammag parsing as presented by reference 9. 
ite Context-free Parsing 
Let G be a Context-Free grammar, and suffrcse that 
the predtctions P are numbered 1,2,...,f. Let V te in the 
Seem VI U VN*. Then: 
ae A left parse of V is a seguence of productions 
used in a leftmost derivation of v frem S. 
b. A right parse of v is the reverse of a Sequence 
of froductions used in a rightmost derivation cf v from S in G. 


Consider the grammar where the productions are numbered 
as shown: 
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aw IRR 
J 6) 5p SY ae 

C cee lees. 3 
d.  Pi:=F--4 
Comes. = C1) =o 


fie als SSO 
The left parse of the sentence a*¥(ata) is 23465124646. 
The right parse is 64642641532. The right most déerivaticn 


from E is: 


als =SEtE:1 

te =PEtLT*F 33 
Ce =>EtT*a:6 
d. =PDEtF¥*az4 
iz =>Eta*a:6 
Gg. =>Tt+a*ai2 
ne =>Fta*¥as4 
le =Pata*a:6 


| Writing in reverse, the sequence of predguctions used 
in this derivaticn gives the right parse 64266431. In geéen- 
erai a Laight parse for a string v in a grammar G iS a seq- 
uence cf productions which can be used to reduce v to the 
Start sytrol S. 

The parsing proceeds using essentially a Light parser 
cycling through ali possible rightmost derivaticns, in re- 
verse, that are consistent with the input. A mcve consists 
of scanning the string on top of the pushdown stack to see 
if there is a right side of a production that matches the 
Symbols on the top of the stack. If so, a reduction is made, 
Ceplacing these symbols by the symbol on the left side of 
the production. 

| If no reduction is pessible, then the next input 
Symbcl is placed on the pushdown stack and another attemft 
1s wade tc reduce the stack. This process continues until 


the grammar has been parsed or an error occurs. 
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Ze LR(0) and SLKR(1) Parsing 

In reference 9, De Remer states that to ccnstruct 
an LR(0O) parser for a Ccntext-Free grammar G, configuration 
sets must be computed. A configuration set represents a 
state of the parse, that is, which parts of which produc- 
tions may have been used to generate the input string to 
the point of the parse. 

Each non-empty configuraticn set has one cr more 
SUCCESSCLES OL configuration sets. In general, a configu- 
ration set $(1i) has an S-successor for each symkcli §s in 
V that is preceded by a marker in one or more of S(1)'s 
configurations. A marker iS a pointer to the next fossible 
symbcl to be read in the input list. 

Ahl S~Successor state consists of a basis set combined 
with a closure Set. The basis set consists of all config~ 
uraticns in S$(i) having a marker before an s, but with the 
marker moved to follow the S(i). The cloSure set is defined 
+h 


OL CONSIGgutdercius cna ec 


GE 


recursively to be the largest se 


until a terminal symbol 


ct 


can ke derived from the basis Se 
is reached. 

An LR(O) parser for a grammar G iS represented by 
the set cf all configuration sets, where each set is a state 
of the parse with an accessing symbol, and a list of possible 
symbols which can be read next with an indicaticn of whether 
to read the next Symbol and go on to ancther state, or to 
reduce and go on to another state. 

A Context-Free grammar is aad ‘to be abn i) ie 
and only if each of the inadéguate states of its successcr 
states has mutually disjoint (simple) 1-symbol lock-ahead 
sets which allow the parser to determine which reduction 
to make. An inadequate state is one in which the parser 
must look ahead k symbols in order to determine which re- 
duction to make. 

An SLR(1) parser for a grammar, therefore, is repre- 
Sented by the same set of configuraticn sets as is re- 
guired for an LR(0) parser with the addition of the simple 


1-lcck-ahead sets. 
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SLR(1) parsers make a decision to reduce kased ufen 
all the symbols in the parse stack plus one nore from the 
input text. The parse is accomplished Ey restructuring 
the stack after each reduction and Saving the state of the 
parse to allow the parser to know where it has keen so that 


it can make the right decision for the next reduction. 


Meeeosik (i) SYNTAX ANALYSUS OF CHS-2RS 

fhe SLR(1) Syntax Analyzer and parsing table cenerator 
produced by Woods and described in reference 11 was chosen 
to analyze the grammar because of its speed of executicn. 

In develofiag an SLR(1) grammar for CMS-2RS, it was 
necessary to write a complete initial description of the 
language and then take Small secticns cf the gramuaar and 
analyze them until they were SLR(1). In that fashion, 
the grammar was sSuccesSively built up and revised until a 
complete SLR(1) grammar was obtained. The execution tine, 
using Wcod's PL-360 based Syntax Analyzer, was cn the order 
of 9 to 10 seconds of CPU time for about 200 productions 
On an IBM-360 Model 65 computer. 

Curing the grammar analysis it became apparent that con- 
bining the same sSub-strings of symbols cn the right part cf 
several productions intc new productions greatly reduced 
the number or terminal transitions. This allowed a larger 
gcammar to be handied for the same table sizes in Wood's 
Syntax Analyzer. The resultant grammar is thus highly op- 
timized in terms of parsing table size: The limit cf 255 
producticns in the Syntax Analyzer, however, did require 
that the revised grammar be split. The data declarations 
and header declarations were handled by separate grammars 
Since they are blocked by easily recognizable beginning and 
Ending delimiters. The implementation of this split and 


its implications are discussed in Secticn V. 
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F DHE IWO-PASS CMS~2RS COMPILE 


The purpcse of this project is to take the first step 
tcward a practical implementation of CMS-2RS. A two-pass 
compiler was selected taking into consideration main memory 
requirements and project modularity. Also, having twe 
passes cffers a potential for ccde optimization in the 
seccnd pass. This approach facilitates the task of making a 
significant start on a useful compiler in a limited time by 
isolating the analysis and synthesis functions from the 
detailed ccde generation, storage assignment, and interface 


handling functions. 


fee FLRST FASS \ 


The first pass performs four basic functions. [It scans 
the characters of the source program, builds the symtcl 
table, parses program sentences, and generates an interme- 


diate language representation of the source program. 


1. iexicai Analyzer 
texX1cdasi ANALYZer 


= 


The lexical analysis 1s ferformed by the procedure 
SCANNER. In writing the scanner's case statement, it was 
decided tc use a different case for each delimiter (except 
for "*#*t and “a= which are handled with "*" and "A" re- 
Spectively). By doing this, there is no need for having a 
table of ail the tokens in the language and the corresfond- 
ing table lookup mechanisn. SCANNER does all the symbol 
table lookup and enter operaticns required by the compiler. 
When an identifier 1s scanned, the symbol takle is searched 
to determine if this identifier is a reserved word or if it 
1s an already entered name. If not found, the new name is 
entered in the symbol table. From that point on each 
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identifier is represented py its index into the symbol table 
so that ne further search is ever needed. 

As will be explained later, two independent analyzer 
procedures are used to parse the declarations and the 
dynamic statements. For this reason it became necessary to 
build a linkage between SCANNER and these two analyzers that 
would allcw the implementation of the necessary switching 
mechanisms. The linking functicn is performed by the frece- 
dure SCAN. SCAN is called by the analyzers when the next 
tcken is desired. SCAN calis SCANNER to oktain the next 
symktol in the input stream. When SCANNER returns certain 
reserved words, SCAN switches control to the appropriate 
analyzer. Fer those tokens that appear in beth grammars, 
SCAN alicws for two different internal representaticns. 

SCAN alsc facilitates the implementation of certain symbol 


table handling mechanisms as will be explained later. 


2. Symbol Tabie Design 


———— 


After ail memory allocations tc variables used by 
the compiler are satisfied, the remainer of the assigned 
region is allocated to the symbol table. The symhol table 
is divided into two main parts: the identifier -directcry and 
the constant table. 

ae Block ‘Structured Identifier Directory 

The following methcd was chosen to implement the 
two level (global, local) block structure cf CMS-2 uSing 
hash addressing for table search. Every identifier is 
ccncatenated to a two character prefix which indicates the 
devel of such identifier. Prefix "00" correspond tc glockal 
scope, while all other prefixes fren "01" to "99" correspond 
to local scofe. Therefore, every identifier declared ina 
SYS_DD is entered in the identifier directory with frefix 
"OO". For example, variable "ABLE" would be known to the 
compiler as "QOABLE". A block counter is maintained and 


eCach time the reserved words SY¥S_PROC and SYS_PROC_R are 
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encountered this counter is incremented and the current 
prefix is correspondly altered. Thus, variables declared 
within LOC_DD's or AUTO_DD's are entered in the directory 
with the prefix corresponding to the sequential numker of 
the SY¥YS_PROC where these data designs appear. When SCANNER 
reccgnizes an identifier, it first looks for its presence in 
the identifier directory with the current prefix and, if not 
found, then with the global prefix ("00"). 

b. Hash Addressing | 

The chained hash addressing technique described 
by Gries [Ref. 12] is used to access the identifier direc- 
tory. A hash table size of 1229 was chosen to minimize the 
number of collisions in medium Size programs. This size can 
be changed te any desired prime number, requiring only one 
minor change in the froceaure HASH. The hashing scheme uses 
the number af characters in the word, the second and third 
Characters, and the last and next to the last characters as 
arguments. ‘the hashing function is applied to the identi- 
fier after it has been concatenated to the proper prefix as 
previously discussed. Procedures RESERWRD and LOOKUP 
handle all the required symbol table searching. The 1ist of 
reserved words is preloaded in the identifier directcry by 
the procedure INITIALIZE. No prefix is concatenated tith 
Teserved words. In addition, no collisions occur when 
hashing reserved words, so RESERWRD needs only to compare 
the identifier scanned with one entry in the directory to 
find if it is a reserved word. Caution must be exercised if 
the size of the hash table is changed or if the languace is 
cLéevised to include new reserved words to insure that this 
prorerty is preserved. 

Procedure LOOKUP follows the basic flow chart 
fcund in page 222 of Ref. 12. LOOKUP adds the current block 
humrer prefix to the identifier and then hashes it and 
Searches the identifier directory. If the identifier is not 


found then LOOKUP tries again using prefix "00." If still 
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net fcund, the identifier iS entered in the directory with 
the current prefix. : 
c. Identifier Directory Design 

The identifier directory was designed taking in 
ccnsideration the large variety of descriptors required to 
represent the wide vatiety of conditions encountered in the 
language. Appendix D explains the descriptor formats in 
detail. The descriptors are stored in the constant table; 
hence, the identifier directory can have a simple and 
uniform structure. Each identifier directory.entry uses 15 
bytes distributed in four fields as shown below. 

(1) Ten bytes are reserved in the identifier's 

name field to acccmodate a maximum of ten 

characters (eight characters from the scurce 

plus two in the prefix). 

(2) One byte is used for the semantic ccde. 

fhe semantic code is an integer from 1 tc 148 

that identifies the teken recognized Ly SCANNER 

to procedure SCAN. 

€3) Two bytes hold the chain field. fhis 

field is used in case of collisions to stcre a 

Fointer ccnnecting entries with the same value. 

C4) Two bytes are used for the descriptor 

pointer field. This field contains a pointer to 

the constant table where the descriptor 

containing all the attributes of this identifier 

is stored. 

d. Constant Table Design 

{The constant table is essentially a large array 
used to store constant values as they are recognized Ey the 
scanner and descriftors for identifiers as freviously 
indicated. then procedure SCANNER scans a constant, it 
stores it in the constant table. This includes character 
ccnstants as well as numeric or bits constants. The first 
byte of every entry in the constant table is used to store 


the length of the entry , i.e., the number of bytes that 
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follow. In the case of numeric constants, the need arises 
tc differentiate between the six different types of numeric 
constants used in the language. In order to do this, the 
low order five bits ot the header byte are used to indicate 
the length of the entry as explained above. The remaining 
three high order bits are used to indicate the type cf 
numeric constant accecrding to the following code: 

000 for octal integer 

001 for octal fixed 

010 for octal real 

011 for decimal integer 

100 for decimal fixed 


101 for decimal real. 


3. Syntax Analyzers 


fhe syntactic analysis is performed Ly the precce- 
dures ANALYZE and BANALYZE. ANALYZE is the main grammar 
parser. It is calied once by procedure MAIN and returns 
Only When the source program has been completely parsed. 
ANALYZE eails SCAN each time a new token is to be read. 
When SCAN reccgnizes any of the reserved words SYS_DD, 
AUTO CD, LOC LD, PROCEDURE, or FUNCTION it calls EANALYZE 

which is the second parser, BANALYZE nandles the 

declaraticns grammar. It also uses SCAN in the same nanner 
as ANALYZE. When BANALYZE compietes the parsing of a data 
design or a procedure or function declaration it returns to 
the ~lace where it was called by SCAN, and SCAN passes the 
corresponding token to ANALYZE which continues execution. 
With this scheme, whole declarations are Seen as a Single 
terminal symkrol by the main grammar. This ccncept, of 
course, can be extended to any number of parsers. FOL 
example, when a language is rather extensive (a classic 
exanple wcula be PLyI), it may become extremely difficult to 
develop a single grammar of a given type. This job is 


greatly simplified by dividing the task intc a main grammar 
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and séveral c¢mponent grammars. In additicn, the most 
aPpprcopriate parser can be used for each component grammar. 
For example, a component grammar could handle expressions 
and use an operator precedence parser. Using the interme- 
diate linkage concept between the scanner and the syntax 
analyzers as explained before, all that is needed is a 
special symkcl to delimit each component grammar. These 
special symbcls can be part of the syntax of the ianguage or 
they can be added to the input stream by the compiler 
itself. 


4. cEackus~Naur Form (BNF) 


Appendix A contains a complete listing, using 
Rackus-Naur Form notation, of the SLR(1), ccentext free 
grammar developed for the CMS-2RS language. The grammar is 
divided into a main grammar and a component grammar. The 
symtoils <SYS DATA DECL>, <ROQCAL DATA DECL>, <AUTO DATA DECL> 
and <SUB_RKOUTINE DECL> are treated in the main grammar as 
terminal symbols. The component grammar handles these 
declaraticns. The mnéchanisms te implement this scheme are 


explained in Section 3 above. 


5. Intermediate Language (IL) 


Generation of the object frograr in an intermediate 
language form iS One of the primary tasks of the first fass. 
Due to time limitations, the generation of intermediate 
language could not be implemented. An IL format that could 
be used in future work on this ccmpiler is froposed in this 
secticn. 

ae iL Format | 

Each entry in the IL table consist of three 
Fytes. The tigh crder seven bits cf the first byte are used 
tc represent the operation code. the low order remaining 
bit is used to indicate an indirect operation. A set of 


Operation codes with their meanings iS Shown in Appendix E. 


63 





The codes shown Should be sufficient to represent the entire 
CMS-2KS language. The second and third bytes are used for 
the operand field. In most cases this field will contain 
the address ci the operand's entry in the identifier direc- 


tcry cr the constant table. 


6. First Fass Cutput 


Tre first fass output should consist of two disk 
files, one fcr the symbol tabie and another for the interme- 
diate lancuage generated by the dynamic statements. Future 
wcrk required to complete the first pass includes IL genera- 
tion, and the writing of two rcutines, cne to write the 
symbol table into a disk file and another to write an IL 


kuffer into a second file. 


Be SECOND PASS 


The main task performed by the second pass will kre the 
translaticn cf intermediate language into machine language 
(ML). Routines must be déveloped to read the IL file and 
produce ML which executes on the target machine. fTfhis 
routines must insert the ML necessary to ccmpute subscrift 
values, perfcrm data type conversion, map actual to fcrmal 
parameters, and implement dynamic allocation. Some code 
optimization capaoliity should also be inciuded here. 

Storage areas for variables and tables must be assigned 
in this fass. Routines must be provided that will process 
the symbcl table file and perform static allocation cr 
generate dynamic allocation mechanisms. This pass must 
alicw for linkage with other program modules. Communication 
links must ke established for external references. ‘MThese 
links shcyld be compatible with the requirements of the 


linking rccutines of the target machine's operating systen. 
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VI. CONCLUSIONS 


ee rents queen See ee ee ee ree cee 


Several attemps have been made, with less than optimal 
results, tc implement a universal language and compiler 
system. CMS-zZ is a system of this type, but it has met with 
limited success when used to implement large, self-ccntained 
computer systems. To get around the language limitations, 
extensive use of the direct code option has keen made. This 
practice defeats the purpose of a high level language and 
creates problems in system design and maintenance such as 
correctness of initial program logic and program segment 
integrity. if the potential of a high level language and 
compiler system are to be fully realized, there must be 
controls over which .anguage constructs are allowed at each 
level of software. Furthermore, insertion of direct ccde 
Should not ke allowed. The CMS-2RS system 1S a step in that 


direction. 


Ae. RESULTS 


The mcst apparent ambiguities, redundancies, and limi- 
tations which exist in the CMS-2 program, data, and ccntrol 
structures have been identified. Proposed corrections to 
these deficiencies have been incorporated in CMS-2RS. An 
SLR (i) ccntext free grammar has been defined for this 
language using BNF notation. This grammar 1s suitable for 
the construction of parsing tables uSing an automatic 
compiler generating system. Such a system, using Wood's 
SLR(1) ANALYZER, was employed to produce parsing tables for 
the first pass of the CMS-2RS compiler. 

In order to support the structured programming ccncerft 
in a modular library environment, changes where made to the 
external referencing mechanisms of CMS-2 so that external 
references are only allowed between ditferent compile-time 


Systems. Header declaration statements were also revised to 


65 





prevent mcdificaticns of an Element's internal data alloca- 
tion and control structure representation. This CMS-2 
construct facilitates the violation of program segment 
integrity and allows careless program deSign and 
maintenance. 

The concept of grammar segmentation was igplemented in 
the CMS-~2RS compiler. Data designs and dynamic statements 
were groufed into separate grammars which are parsed by 
different sections of the compiler. This concept is easily 
extendaple to other sections of the grammar such as exfpres- 


sions and header statements. 


Be. FUTURE WORK 


Several tasxs need to be completed in order to fully 
imsplement the CMS-2kS system. The work required to complete 
the first pass includes the completion of intermediate 
language design and the writing of the semantic routines 
reguirea for the translation of dynamic statements intc 
intermediate language form. Alsc, header and user file 
declarations need to be implemented. This wculd be 
accemplished by adding the header and file declaration 
statements to the declarations! grammar and writing the 
correspondig semantic routines. File manipulation operators 
Should be added to the main grammar. 

The next step is to write the second pass. The second 
pass must handle a variety of functions including mencry 
alilccaticn, data type convertion, subscript calculation and 
checking, parameter mapping, and code generaticn and ofti- 
mization. 

Further development efforts should include studies of 
the desirability of adding list processing constructs to the 
CMS-Z2kS language. 
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APPENDIX A 
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APPENDIX B 
DESCRIPTION OF THE CMS~2RS LANGUAGE 


The CMS-2RS language is described in the following 
sections uSing a modified metalinguistic notation which 
was originally developed by Eackus and is descriked in ref- 
erence 1zZ. This modified notation uses the following Easic 
symbols: 

::= A connective meaning “is defined tec fe", 

| A connective meaning “or defined to Fe", 
< > Delimiting brackets enclosing a netalinguistic 
variable. 

2% Uelimiting traces enclosing Lrackets meaning 
“more than one metalinguistic variarle", where 
each variable is separated by a comma, dollar 
Sign 3S}, or space*’as appropriate. 

The sections below describe the syntax and sé€mantics, 
and give €xamples for each structure or statement in the 
langvuagé. Precgram structure 1S described in Section 1. 
Section 2 describes the basic symbols and delimiters of the 
languace that are formed from the CMS-Z2KS alphaket. Data 
structures and referencing conventions are descrited by 
Section 3, followed by control structures in Section 4. 
Section 5 describes Real, Bits, Status, Character and 
Boolean expressions. The header and dekug statements were 
not i1goplemented in the present version of CMS~2FS and are 
not included in this appendix. In the examples telow, the 
notatior "<metalinguistic variable>" is used when the Iten 
has not yet been defined, but its inclusion is necessary 
for a ccuplete descripticn cf the example. References to 
imposed limits on the numbers of elements allowed in the 
Statements that follow are actually limitations inposed by 


CMHS~-zRS compiler and its table sizes. 
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1. Program Structure: 
ae System and System Elements 
Sylitas. 
<SYSTEM>3:=SYSTEM <IDENTIFIERSS 
{{<SYSTEM ELEMENT>Z ENDS 
<SYSTEM ELEHMENT>::=<HEADER DECLARATIOND>D 
I<SYSTEM DATA DECLARATION>D 
{<SYSTEM PROCEDURE CECLARATION>D 
1<SYSTEM REENTRANT PROCEDURE 
DECLARATION>D 
PEXTRGF <TDENEM FEE ROS 

Semantics: 

A System is a compile-time grouring of System Ele- 
ments, headers, debug statements, and externally referenced 
(EXTKEF) System Element identifiers. System Elements are 
€lther System Data, System Procedure, or System Reentrant 
Procedure declarations. The order in which Elements are 
listed is up te the prograrmer and specifies the Gesired 
order of compilation and iinking. A header may cnly be in- 
Cludé€d once at the beginning of a System. The total number 
of Elements allowed in a System is ninty~-nine (99). 

Examples: (see Section 1d.) 

bE. System Procedures and System Reentrant Precedures 
oy @ax. 
<SYSTEM PROCEDURE DECLARATION>::=SYS_ERCC <IDENTIFIER>S 
| §<SYSTEM PROCEDURE 
ELEWENT>? END$ 
<SYSTEM REENTRANT PROCEDURE DECLARATICNDS::=SYS_ERCC_R 
<IDENTIFIER>$S {<SYSTEY REENTRANT 
PROCEDURE ELEMENT>?¢ ENDS 

Semantics: 

Both tyres of System Procedures are the Easic build- 
ing blocks of the CMS-2RS language. Precedures may reference 
each other only through their prime procedures and are sejf- 
acrately ccmpileable and executable. Reentrant procedures 


consist of separate sections of data and non~changeable code. 
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Examples: (see Section 1d.) 
c. System Procedure Elements 
syntax: 
<SYSTEM EROCEDURE ELEMENTS::3=<LOCAL DATA DECLARATION>D 
{<AUTO DATA DECLARATIOND 
{<PROCEDUKE SUB-ROUTINE 
DECLARATION>D 
1<FUNCTION SUB-ROUI BNE 
DECLAKATION> 
<SYSTEH REENTRANT PROCEDURE ELEMENTS: 3=<AUTO DATA 
DECLARATION>D 
[<PROCEDURE SUB- 
KOUTINE LECLARATICN>D 
I<FUNCTION SUE- 
ROUTINE DECLARATION> 

Semantics: 

The elements in koth types of System Precedures 
may ke ordered in any manner, but Auto Data declarations 
must precede the applicable sub-routine declaration. Auto 
Data declarations are dynamically allocated at execution 
time, with a separate ccpy provided to each calling sub- 
routine. 

Examples: (see Section 1d.) 

d. Procedure and Function Sub-routine Declarations 

Sy Brats = 7 
SERCCEDURE SUE-ROUTINE CECLARATIONDS: S=PROCEDURE <IDENTI-~ 

FIER><INPUT PARAMETERS><OUTPUT FARAMETEFS><LABEL 

PARAMETERS>) $<STATEMENTS$S 
<INEUT EARAMETERS>: := ( § <VARIABLE>? 
<OUTEUT EARAMETERS>::=| §<VARIABLE>? 
<LABEFL EARAMETERS>: :={| $<LABEL NAME> @ 
<FUNCTICN SUE-ROUTINE DECLARATIONS: s=FUNCTION <DATA TYPED 

<IDENTIFIERD><INPUT PARAMETERS) $<STATEMENTOS 

Semantics: 

A Procedure may have any one, twce, three, cr none 
of the mempers of the set of input variakles, cutfut var- 


lables, and label names as formal parameters. Feerunct 1.O ny, 
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however, must have at least one input farameter. All input 
and outfut parameters must have been previously declared 
in data declarations, but Function names are declared 
when the Function 1s defined. Both Procedure and Function 
sub-routine declarations must include cne statement. There 
is no imposed limit on the number of sSuc~routine déclaraticns 
allowed within a System Procedure cr Reentrant FErocedure. 
Examples: 
SYSTEM ALPHA $4 
<BHEALER DECLARKATION>D 
<SYSTEM DATA DVECLARATION>D 
BATREF Aes 
Si Smr Rec mcms 
<LOCAL DATA DECLARKATION>D 
EROCEDURF E {IN| OUT] | LABEL) $ 
<STATEHENTD S$ 
<AUTO DATA DECLARATIOND 
FUNCTION C (IN1)$ 
<STATEMENTD$S 
ENDS 
SY¥S-PROC_R D$ 
<AUTO DATA LECLARATION>D 
PROCEDURE D |JOUT1]|LAEiL1) $ 
<STATEMENTSS 
<AUTO DATA LECLARATION> 
FUNCTION E (IN2)$ 
<STATEMENTOS 
ENDS 
ENDS 
€. Statements and Blocks 
Syntax: 
<STATEMENT>:: =<LABEL NAME>:<BASIC STATEMENT> 
|<EASIC STATEMENT> 
meyolkC STATENENT>S$:=<SIMPLE STATEMENTS 
I<VARY STATEMENTS 
I<CASE STAYTEMENT> 
[<HHL DE, STATEMENTS 
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|<IF STATEMENT> 


<SEMPLE STATEMENT>::=BEGIN $<STATEMENT>? END 
{COBEGIN $<STATENENT>€ COENL 
|<SET STATEMENT> 
{<SWAP STATEMENT> 
{<SEARCH STATEMENT> 
{<INPUT STATEMENT> 
|<OUTPUT STATEMENT> 
{<ENCODE STATEMENT> 
[<LECODE STATEMENT> 
{<PACK STATEMENT> 
|<UNECK STATEMENT> 
{<PROCEDURE STATEMENT> 
[<GOTO STATEMENT> 
{<RETURNIO STATEMENT> 
{<RESUME STATEMENT> 
[|<RESERVE STATEMENT> 
}<WAIT STATEMENT> 
{<RETURN STATENENT> 
{<STOP STATEMENT> 


Semantics: 


A statement directs some acticn or contrcls the ex- 


ecution of a group of statements. 
themselves be a statement, 
These goups are delimited 
BEGIN-END or COEEGIN-COEND and may 
BEGIN=END statenzents imply 
and COEEFGIN-CCEND statements imply 
COEBFEGIN-COEND statements, 
cially. 


grours. 


level. 


howeveL, 


The group of statements may 


thus allowing nesting cf statement 


by the reserved words 
be inter-nested to any 
sequential processing 
parallel processing. 


may also be processed sequen- 


All statements within COBEGIN-CCEND biocks may only 


Change-access mutually disjoint sets of data anc ccntrol 


Structures, that is, 


data or control structure. 
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Example; 
FEGIN 
<STATEMENT> 


<STATEMENT>S 
CCBEGIN 
<STATEMENT> 


<STATEMENT>D 
COENDS 
BEGIN 
<STATEMENT> 


* 
ea 


<STATEMENT> 
ENDS 


END$ 
2. Easic Symbols and Delimiters 
Ihe CMS~2RS alphabet consists cf letters, digits and 
Special symbols from the U.S. ASCII Standard Character Set. 


The following special symbols are elements of the alphabet: 


+ (plus) ) (cight parenthesis) 
- (minus) $ {dollar sign) 
| (slash) - (comma) 
* (asterisk) y (prime) 
5 (decimal point) | (bar) 
( (left parenthesis) (Space) 
< (left Eracket) — (underline) 
> (right bracket) ; (colon) 
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a Ne > ee 


- (equal) "6 6(guote) 
-~ (nct) & (ampersand) 
f- (pound) F, (percent) 
a. Identifiers and Reserved words 
syntax: 
SP VENTIFIERD 33=<hilTER> 
{<IDENTIFIERD><DECIMAL DIGIT> 
{<I DENT IFIER><LETTER> 
(<IDENTIFIER>_ 
<LETTERS: S=A{Bl...|2 
SeCrPMAL DIGIT>ss=T12}... {940 

Semantics: 

Identifiers are composed of Sequences ci letters 
and digits of any length, and must begin with a ietter. Oniy 
the first eight positions are used and hence twe different 
identifiers must ke unigue in the first eight positions. 
The blank space must be used as a delimiter between suc- 
cessive identifiers and may be used between deliniters and 
jdentifiers for clarity. 

Examples: 

ALPHA 

SYS_ PROC 

BETA1 

b. Real Constants 
Syntax: | 
<REAL- CCNSTANT> : :=<OCTAL CONSTANT> 
{<DECIMAL CONSTANT> 
<OCTAL CONSTANT>::=&6<OCTAL INTEGERD>D 
{S<OCTAL FIXED-POINT> 
{E<OCTAL FLOATING-~PCINT>D 
<OCTAL INTEGER>::= <OCTAL DIGIT> 
|<OCTAL INTEGERS<OCTAL DIGIT> 
<OCTAL DIGIT>::=O]{1f2]...[6{7 
<OCTAL FIXED-FOINTS::=<OCTAL INTEGERD. 
{ <OCTAL INTEGERD>D 
{<OCTAL INTEGER>.<OCTAL INTEGER> 
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SoC PAL RLOATANG—-POINTS$ s=<OCTAL INTEGMR>ESOCTAI EXPCNENTS 
 Y~OCTAL INTEGERDE<SIGNEL CCTAL 
EXPONZNT> 
|<OCTAL FIXED-EQINTDOEXCCTIAL EXPONENT> 
J<OCTAL FIX42D-POINT> &< SIGNED OCTAL> 
EXPONENT> 
<OCTAL EXPONENTS::3=<OCTAL DIGIT><CCTAL DIGIT> 
<SIGNHELD CCTAL EXPQNENT>::=+<OCTAL EXPONENT> 
{-<OCTAL EXPCNENT> 
<EXPONENT>:3:=<DECIMAL DIGIT><DECINAL DIGIT> 
<DECIMAL CONSTANTS 3:3=<DECIMAL INTEGER>D 
[<DECIMAL FIXED-POING> 
{<DECIMAL FLOATING-POINT> 
<DECIMAL INT£GERDS::= <DECIMAL DIGIT> 
<DECIMAIL FIXED-FCINTS: s=<LECIMNAL INTEGERD. 
[.<DECIMAL INTEGER> 
1<DECIMAL INTEGER>.<DECINAIL INTEGER> 
<DECIMAIL FLCATING-PCINTS: :=<DECINAL INTEGERD EC DECIMAL 
EXEFONENT2 | 
{<DECIMAL INTEGERDSE<SIGNED 
DECIMAL EXPONENT> 
[<DECIMAL FIXED-POINTSE<DECIMAL 
EXPONENT? 
{<DECIMAL FIXED-POINTOE<XSIGNEL 
DECIMAL EXPONENT) 
<DECINAL EXPONENT>: :=<DECIMAL DIGIT><DECIMAL DIGIT> 
<SIGNECD CLECIMAL EXPONENTS: :=+<DECIMAL EXPONENT> 
|-<DECIMAL EXPONENT> 
Semantics: 
The total number of digits and characters ( ., +, -, 


and —) allowed in a real corstant is twenty-two (22). 
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Examples 

1 

1.0 

Boe 2) 

ae 

2e9E+13 

3E10 

pekw~ 12 

+1 

~2.4 

&~7E+ 12 

&77 

c. Bits Constant 

3 ymia xs 

<BITS CONSTANTS: :=f¢ <HEXADECIMAL DIGIT> 
iSBLES CONSTANTOSHESALECIMAL DIGITS 
<HEXADECINAL DIGITS :2:=<RECINAL DIGITS jAyBJ...4F 

Semantics: 

The Bits constant is a Sequence of binary digits which 
is represented by a sequence of hexadecimal digits. Each 
digit represents four binary digits of cnes or zeroes. The 
numkex cf hexadecimal digits allowed in a Bits constant is 
twenty=two (22) including the pound sign @. 

Example; 

4FFI2Z34ABFE 

d. Status Constant 

syntax: | 
<STATUS CONSTANTS: :='<LDENTIFIER>D! 

Example; 

PHO qa 

PCOLDe 

"ALERT! 

we otL log econstalt 
syntax: 
BoLRING CONSTANTS: s="< ALPHABET SYMBOLS" 
pi<STRENG CONSTANEO<ELPHABET SYMNBOLD" 
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<ALPBABET SYMBOLS: :=<LETTILR> 
{<DECIMAL DIGIT> 
I<SPECTAL SYMBOL> 
<SPECIAL SYMBOLD: s=t+{-[/i*t-i) EL CiSt, ' [stace 
I<E>dHtrmbetlis tl 

Semantics: 

There is no imposed limit cn the number cf characters 
allcwed in a string constant. aA double quote ("""), however, 
is nct allowed. 

Example: 

“STRING CONSTANT" 


LI. Boolean Constants 


> yn tak: 
<BOOLEAN CONSTANTS: s=TRUE 
| FALSE 
Semantics: 


The internal value of TRUE 1s one (1) and FALSE is 

zero (Q). 
ge. Comments 

syntax: 

<COMMENT>::=Z<ALPHABET SYMBOL> Y% 
{(7XCOMMENT><ALPHABET SYMBOL>Z, 

Semantics: 

The comment statement may be used to document a pro- 
gram. It may be inserted anywhere within the frecqram or its 
statements but there must be at least one blank inserted if 
it follows an identifier. 

Example: 

SET: THIS IS AN ASSIGNMENT STATEMENT A TC BS 
3. Data Structure 

a. Data Declarations 
Syntax: 
Sorolkt DATA DECLARATION>:3:=SYS_DD <ICDENTIFIERSS 
§<DATA ELEMENT>¢ ENDS 
<LOCAL DATA DECLARATIONDS::=LOC_DD <IDENTILFIERD? 
E<DATA ELFMENT>2 ENDS 
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SAuiC LATA DECLARATION>D: s=AUTO_ DD <IDENTIFIERSS 
§<DATA ELEMENT>£ ENTS 
<DATA ELEMENTS: :=<VARIABLE DECLARATION>D 
[<TABLE CECLARATION>D 
I<FORMAT DECLARATIONS 
(<TAG DECLARATION> 

Semantics: 

A data declaration defines the data identifiers and 
their attributes which apply to the dynamic statements that 
folicw. There iS no imposed limit on the number cf data 
elements ailowed within a data declaration or on the nun- 
ber of data declarations ailowed within a Systen. System 
data declarations apply to the entire compile-time System 
and Local or Auto data declarations arrly only tc the Systen 
Procedure in which they are defined. 

b. Variable Declarations 

syntax: 
<VARIABLE DECLARATICN>::=VREL <DATA TYEFE> <ITEM> §$ 
<DATA TYPE>: :=INTEGER(<DECIMAL INTEGERD) 

{FIXED (<DECIMAL ZINTEGER>,<DECIMAL INTEGERD>) 
{FLOAT (<DECINAL INTEGER>) 
[EITS (<DECIMAL INTEGERD) 
ICHAR (<DECIMAL INTEGERD) 
[STATUS (§<STATUS CONSTANT>) 
| BOOLEAN 
SIT EMD: s=<ITDENTIZIER> 
[<IDENTIFIERS>=<CONSTANTSD 
<CONSTANTS: s=<REAL CONSTANTS 
1<BITS CONSTANT> 
1<CHARACTER CONSTANT) 
{<STATUS CONSTANTS 
| <EQOLEAN CONSTANT> 
{<TAG> 
<TAGS33=<IDENTIFIER>D 
Semantics: 
The variable is a one-dimensicnal data structure used 


meectOLe* data values for each data type. There Ws no in- 
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posed limit on the number of Items allcwed in a variable dec- 
laraticn. The farenthesized integers in the data type speci- 
fication determine the length of the Item. For types Inte- 
ger, Fixed, Float and Bits the length is in binary digits. 
The actual length implemented, however, may vary cepending 
upon the arithmetic and addressing characteristics of the 
target nachine. fhere is no imposed limit on the number 

of character symbois allowed in a character variatle. 

Status variables do not have length in the usual 
sense hut have a capacity based upon the number of status 
constants assigned them up to a maximum of 127. 

Examples: 

VREL INTEGER (15) A,B1,C=123765, D3$ 

VRBL CHAR (34)C1="THIS IS A STRING?" 

VERBS STATUS (‘COLD | wHeEt! (| "EOT') HEBTHERS 

VRBL ECOLEAN ONT ARGET=FALSE, HOSTILE=TRUES 

¢e Gag Declaration 

Syntax: 
<TAG DECLARATION>::=TAG 4 <IDENTIFIER>=<REAL CONSTANI>£ $ 

Semantics: 

A tag is a name for a Real constant. Once de~ 
clared, tags may be used in data declarations and dynamic 
statements, where the appropriate constant value will be sub- 
Stituted. There is no imposed limit on the numker of identi- 
fiers defined in a tag declaration. 

Example: | 

TAG A=3.5, E=1,C=3.2E~105 

d. Format Declaraticn 

Syntax; 
<FORMAT DECLARATIONS: 3: =FORMAT <IDENTIFIER> (<CAKRIAGE 

CORT RCL> ik DESCRIFTOR GROUP>¢) $ 

Symbcl: Definiticn: 

<CARRIAGE CONTROL>::=Space Single space and print line. 
{0 doukle space and frint line. 
|- triple space and print line. 
{1 page eject and frint line. 


[H Same as 1 and cancel header. 
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<DESCRIPTOR GROUP>:: =<DESCrIPTOR>D 


fm @ <LESCRIPTORAS) 


Symbol: Definition: 


<DESERILETORD: :=Iw.d 


| Fw.d 


[Ew.d 


|Ow.d 


Fixed-point binary to fixed-point 
decimal character string. 
Floating-point rinary tc fixed~pcint 
decimal character string. 
Floating-point binary tc floating 
point decimal character string, 
Fixed~point binary to fixed-point 
octal character string. 

Binary digits to hexadecimal 


Character string. 


I" STRING"Tnteger coded characters to 


[Aw 


{iw 


| wx 


{two 


2/2 


character symbol strings. 

First w symbols of an alphanumeric 
data unit are transferred as charac- 
ters. 

Last w symbols cf an alphanumeric 
data unit are transferred as charac~ 
ters. 

Skip w characters of an inrut record 
or Space w characters in an cutput 
£CCOEU, 

A position designator of tab for 
buffer at character positicn w. 

End o£ record, or number of records 
to be skipped. N+1 slashes cn ingut 
causes n records to be skipped. 

N+1 slashes on cutput causes n Diank 


records to be produced. 


{n<DESCRIPTOR>D 


Semantics: 


The format declaration specifies to the ccmpiler the 


fesoirted ccaversion or data elementls between internal and 


external forms. A format identifier is referenced by INEUT, 


morerur?, ENCODE and LECODE Statements tc descrite data ccn- 
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versicn reguirements. fin the format descriptor, w# 1S an 
unsigned integer representing the maxinua number of characters 
of a fieid in the external medium. Integer w includes 

the space for signs, radix points and exponent descriptions, 
but is limited in size to the width of the output medium line. 
The unSigned integer d represents the number of characters 
that appear to the right of the radix fpecint in the cutput 
medium, hence d must be less than w. The unSigned integer 

nh specifies the number of repetiticns cf a descriftcr to be 
applied to consecutive output fields on a line. the unsigned 
integer m specites the number of repetitions of a group 

of desciiptors to be applied to consecutive output fields on 

a line. A space must follow a format data type symbol such 
as I, E, and H. 

Examples: ! 

FORMAT F100(" ",3E 10,F 14.7,4(1 6,¥ 6.2))$ | 

FORMAT F200("1", "STATUS: ",A bp //gMACTION:", A 9)$ 

€. Table Declaration 

Syntax: 
<TAELE LECLARATION>S 3 :=<TABLE HEADD3<TAELE ELEMENT2$END_TABLE 5 
<TABLE KEAD>: :=TABLE §<IDENTIFIERDS {<DIMENSIOND>?) $ 

[INDIRECT TABLE {<IDENTIFIERR( \<DINZNSION>2) $ 
<DIMENSTCND> 23 =<CCNSTANT> : 
<TABLE ELEMENI>::=FIELD <DATA TYPE>Q<FIELD ITER 

“JITEM AREA <TDENTI£ TERS 
; [SUB_TAELE <IDENTIFIER> ({<DINENSIOND} ) $ 
SeLELD ITEM>: s=< LTEM> 
| | <IDENTIFIERD>= (2 <PRESETD¢) 

| <IDENTIFIZR>=m ({<PRESETDZ) 
<PRESETS 3 3 =<CONSTANTD 

Semantics: 

The Table declaration creates a multi-word data struc- 
ture whose basic element is the Item. All Items in a Table 
contain the same number cf words and Fields. An Item-area 
is a working area with a dimension of one, and the same Field 
Characteristics of the parent Table. A Sub-table is a subset 


of contiguous Itens in a Table, and thus is not a separate 
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data structure. The unsigned integer m indicates that n 
consecutive Fields are to be preset tc the Same value. 

There iS no inposed limit of the number of Table 
declaration blocks. The maximun number of Tables allowed in 
a Table declaration block is twenty (20). The gaximum number 
of dimensions allowed in a Table is 128. The greatest value 
a Single dimension can have is 65,536. The maxinunm number 
of Fields ailowed in a Table Item is ferty (40). There is 
no imposed limit on the number of Item-areas and Sub-tables 
that can be declared within a Table block. 

-To reference an Item within a Table, the Table name 
followed by the subscript list must be specified. Tc ref- 
erence a Field within a Table Iten, the Field name follow- 
ed by the Taole name and subscript list must be specified. 
To reference a Field or Item in a Sub-table is the Same as 
for Tables. The dimensicns must be within the defined 
subtable poundaries or an error will cccur. Tc reference an 
Item-area, the Item-area name in parentheses must be 
specified. To reference a Field within an Item-area the 
Field name followed by the Item-area name in parentheses 
must be specified. 

Example; 

A Diet wi lee, Ce 10,200,909) 5 

FIELD INTEGER (16) TF1=1000 (-1) §$ 

FIELD BOOLEAN TF2$ 

febE LD STATUS( * DOWN * PP READY ©) *ALTRBORNE *) ATRCRAFITS 
SUBTABLE ST1(500,1,1,1)$__ 

ITEM~AREA ITM1S 

END_TABLE$ 
4. Control Structures 

ae VARY Statement 

syntax: 
<VARY STATEMENT>22=VARY <REAL VARIABLED FROM <REAL ERIMARY>D 

THRU <REAL PRIMARY> DO <STATEMENTOS 
JVARY <REAL VARIABLE> FROM <REAL PRIMARYD 
OTEP <REAL PRIMARY>D THRU <BEAL PRIMAKY>D 
DO <STATEMENTS$ 
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Semantics; 

The VARY statement allows multiple loof indicies on 
the same level and mnultirle nesting of loops, with the upper 
limit on the numper of indicies a function of target 
machine characteristics. Indicies must be declared in data 
declaraticns and retain their last value when a VARY loop 
is exited. The initial, step and test values are evaluated 
chnly once when the loop is entered but the loop index variable 
may ke changed during execution. 

The statement or statement block within a VARY loop 
must be-executed once and the index may é€ither Le incremented 
cr decremented until it reaches the test value. Kkhen all 
indicies Qn the same level have reached their test values 
the loop will be exited. A partial pass may be made through 
a locp ty a RESUME statement. This is accomplished by a 
transfer of control to the increment and test ster at the 
end cf the loop. | 

The initial, step, and test value data Elements must 
appear in the order listed, but the step value may te omitted 
in which case an implied increment of one is assured. The 
Real data type for index variables and loop contral values 
is always integer, therefore, fixed-point or flcating-fpcint 
values are not allowed. 

Example: 

VARY A FROM B STEP -1 THRU C DC 

BEGIN , 
SET:D TO E+1 $ 
VARY D FROM F THRU H,E FROM L THRU 2C DO 
BEGIN 
SET3:G TO D-10 3 
SETsM TO EtS es 
ENDS 
ENDS 
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b. CASE Statement 
Syntax; 
<CASE STATEMENTS 33=CASES<REAL DATA UNIT> OF <STATEMENTOS 
ELSE <STATENENT>$ 
[CASE:<REAL DATA UNIT> or § {<REAL CONSTANT>: 3 
<STATENENT>$&2 ELSE <STATEMENT>$ 
{CASE:<STATUS DATA UNIT> OF 3 &<STATUS 
CONSTANT> 3XSTATEMENT>$@ ELSE 
<STATEMENTSS 
Semantics: 
The CASE statement allows selection of a statement 
within a list of statements for processing depending upon 
the value of a data unit. This can pe accomplished in twuc 
ways. the index case allows declaration of a list of n 
Statements with an implied numbering cf 0 to n-1. If the 
value of the data unit is from 0 to n-1 then the appropriate 
statement is executed, or the ELSE sase statement is executed. 
The label case method requires that each label kre checked for 
amatch wath the cata unit. If a match is found then the 
appropriate statement is executed, or the ELSE case statement 


1S executed. 


Example: 
(1). Index Case ’ 
CASESD OF 


CALL PROC1$ 
CALL PROC2$ 
_ ELSE CALL PROC3$ 
{2). Label Case 
CASEFZA OF 
woes SEL: #.70°CS 
Sis 7 sl EMASE LTEENIGOTO Ds 
EDGE SET CASE TO TF ALSES 
c. While Statement 
Syntax: 
SWHELE STATEMENTS: s=WHILE <ECCLEAN EXERESSION> [C <STATEMENTOS 
Senantics: 


The WHILE statement allows a statement cr statement 
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Elock tc be executed ina loop until the value of the Boolean 
expression 1S FALSE. The Boolé€ah expression is evaluated 
each time the loop is executed and if it 1S initially FALSE 
the locgp will not be executed. The Bcocléean expression must 
be made FALSE during loop execution or the loop will execute 
infinitely. 

Examples: 

WHILE A<5 DO 

BEGIN 
Sta2p PO C+is 
SET:A TO A+175 
ENDS 
d. iF Statement 
SYD tax 
<IF STATEMENTS: :=LF <BOCLEAN EXPRESSICN> THEN <STATEMENTDS 
{If <ECCLEAN EXPRESSIONR2 THEN <SIMPLE 
STATEMENT>S ELSE <STATEMENTO$ 
(IF <BOCLEAN EXPRESSION> THEN{ <STATEMENT>S 
WlF <BOOLEAN EXPRESSICN> THEN2 <SIMELE 
STATEMENT>$ SELSE <SIMPLE STATEMENT>? 
ELSE <STATEMENTD$ 

Semantics: 

The If statement allows nesting of IF THEN ELSE 
statements with no imposed limit. Only simple (not CASE, 
VARY, WHILE, RESERVE, Or IF) statements are allowed within 
the nesting structure. Pasic and Sinfle statements, however, 
may ke used icr the last statement after thes last ELSE: 

The ELSE statement is always matched with the nearest IF 
THEN phrase, thus eliminating the dangling ELSE froblen. 

Examples: 

IF A=E THEN GOTO LABEL1$ | 

IF A=B THEN IF C=D THEN SET:A TC D$ 

LERA=DelLHEN Ir C—D THENS SET: A WNC D ELSE SETISB TO C$ 

e. RESERVE and WAIT Statements 

Syn Lax: 

<RESERVE STATEMENT>:: =RESERVE( 9<RESOURCE>?\<SIMELE 
STATEMENTO$ 
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<RESOURCEDS3 3 =<PROCEDUKG STATEMENTS 
{:<DATA UNIT> 
{<DATA STRUCTURED 
<DAIA STRUCTURED: :=<I ABLE IDENTIFIER> (§<REAL PRIMARY>2) 
(<FILE NAME>(<REAL ERIMARY>) 
<TAELE IDENTIFIER>: s=<TABLE NAMED 
(<SU.E=TABILE® NAME> 
{<I TEM-ARKEA NAMED 
<WAIT STATEMENT >:3=WAIG ({<PROCEDURE NAMED? )$ 

Semantics: 

The RESERVE statement may be used to infcram the 
monitor system of an impending entry into a critical section. 
The procedure statements, data units cr data structures 
thus identified will not then be change-accessakle by any 
other processes running in the same multi-progranming or 
multi-processing environment. Upon leaving the RESERVE 
biock, the resources will be freed for change-access by 
cther processes. The WAIT statement has heen added to allow 
synchronization of processes, that is, to specify that the 
ccontrclling frocedure is to suspend execution until one or 


more procedures have completed their execution. 
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Example; 
COBEGIN 
BEGIN 


RESERVE (R1) 
s+ BEGIN 


END$ 
ENDS 


BEGIN 


RESERVE (R1) 


BEGIN 
SA 
END$ 
END$ 
BEGIN 
33 
END$ 
COBEGIN 
BEGIN 


WALT (S2,S3,S6) $ 


END$ 
EEGIN 
WAIT(S1)$ 


END$ 
BEGIN 
WAII(S1)$ 





— ENDS 
CCENDS 
COEND¢ 
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£. SET Statement 
Syntax: . 
<SET STATEMENT>::=SET: <LDATA UNIT> TO <EXPRESSIONAS 
<DATA UNIT>::=<REAL DATA UNIT> 
J<BITS DATA UNITS 
{<ECOLEFAN DATA UNIT) 
(<CHARACTER DATA UNIT> 
<STATUS DATA UNIT>D 
(<TABLE IDENTIFIERD 
pRUALLL GLLNGZILLLAD( <RLEAL FRIMNARY>D ) 
Beefy PATA UNIT. 22=<REAT VAPIEBLED 
I<KREAL FIELD NAME> <TAELE IDENTIFIER> 
( <REAL PRIMARY> ) 
<REAL VARIABLES: :=<INTEGER VARIABLE> 
(<FIXED-POINT VARTABLED 
{<FLOATING-POINT VARIABLED 
<BITS DATA UNIT>::=<BITS VARIABLES 
(<BITS FIELD NAME> <TAELE IDENTIFIER> 
( <REAL PRIMARY>D ) 
<EOOLEAN DATA UNITS>::=<BCOLEAN VARLABLE>D 
|<ECOLEAN FIELD NAFED> <TAELE ILDENTIFIERD 
( <REAL PRIMARY> ) 
<STATUS DATA UNIT>:3=<STATUS VARIABLES 
{<STATUS FIELD NAME> <TABLE ILDENTIFIELR> 
( <REAL PRIMARY> ) 
<CHAKRACTER DATA UNITS: :=<CHARACTER VARIABLE>D 
{<CHARACTER FIELD NAHED <TABLE 
TIDENTIFIER>( <REAL EnIMARYD ) 
<EXFERESSION> 3 s=<REAL EXPRESSION>D 
{<BITS EXPRESSION> 
{<CHARACTER EXPRESSICN>D 
{<STATUS EXPRESSIOND 
{|<BOOLEAN EXPRESSIOND 
Pol aePrLPe DIENT TF GER> 
Semantics: 
The SET statement allows the assignment cf evaluated 


€xXEressions to a compatible data unit cr Table structure. 
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Ali assignments must be tyre compatible and receftacle data 
unit lengths will determine whether the results are truncated 
at the least significant digits or right justified and 
filied in with zeroes. For Character assignments, the 
character symbols on the right of an expression will be trun- 
cated if the receptacle is smaller than the expression, 
otherwise, the result will be left justified and the remain- 
ing positions unchanged. 

The Status aSsignment will transfer to the Status 
Variable the integer value cf the Status constant on the 
meght. 

Bits expression assignments fcllow these rules: 

(1). If the length of the expressicn is greater 
than the data unit, then the result is right-justified in the 
data unit and truncated at the excess bits on the left. 

(2j- en the length of the experessicn as less 
than the data unit, then the result is right justified in 
the data unit and the unused bits are set to zero. 

A multi-word Table-to-Tabie, Item-to-Item or 
Single Item-to-Table assignment resuits in the trans- 
fer of all values from one element tc ancther. Table-to- 
Table assignzent implies that every word of one Table will 
be transferred to every corresponding werd of ancther Table. 
Dimersicn compatibility is the responsibility of the pro- 
grammer. The total number of words in each, however, must 
be the same. Item-to-Item assignments will alsc result in 
the transfer or ail words in one Item tc another Iten. 
Assignment length compatibility is the responsikility of 
the ~rogrammer and excess words will be truncated. iten-to- 
fable assignment implies that every Takle Item is replaced 
ry the item value. 

Exanples: 

Sie): pee” Eb 

PIV W et WOMLES wis 

SbeW kAt ey eo © HOT SS 

SET WEATHER, TEMEEBATURE TO 'HOT'$ 

SEIZ:A TO (B+tC) *D$ 
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SET: (A) ..5 TO C$ 
Shi aAmee Geeeurs « 20 
SEIZsABEO © REN GDS 
SET:A TO E ANDL C$ 
Sh LS AmLOer ALS? 
SET:C TO "CHARACTER STRING 11"9 
Sol Sa tie, Piece b (1,2) 5 
Stels A Dice 2 Om LA bao 2 2 
Sh sieve LOS LTEM iS 
QO. SWAP Statement 
Syntax; 
<SWAP STATEMENTS: 3:=SWAP:<DaATA UNIT> FCR:<DATA UNITS 
Semantics: 
The SWAP statement exchanges the values cf two data 
units. Replacement rules are the same as those fcr the 
obi Statement. 
Example: 
SWAP:C(1,2) FOR:D(2,1)$ 
h. SEARCH Statement 
syntax: 
<SEARCH STATEMENT>: s=SEARCH:<DATA STRUCTURED EFCER:3<DATA UNITS 
Semantics: 
The SEARCH statement provides the capakility to search 
a Table cy File for an entry with same value as a data unit. 
If found, the index variekles in the data structure's dimen- 
sion list will ke set at rur-time to point to the value's 
location, otherwise they will be set tc minus one. Both 
Structures will be searched sequentially starting with the 
first dimension and varying it and adjacent dizensicns uf- 
ward frcm right to left to their maximum limit until the 
Value is found cr nct found. 
Example: 
SEARCH: TABLE1(D1,D2) FOR:B(1, 1,1) $ 
1. INPUT Statement 
S yined As 


99 





<INEUL STATEMENT R: s=INPUT <FILE NAMED INTOS<DATA UNITS 
{INPUT <FILE NAME> <EFORHMAT NAME> 
INTO: <DATA UNITS 
<FILE N2AMED:3 2=READ 
{OCM 
j<USER FILE NAME> 

Semantics; 

The INEUL statement causes a mcnitcr roytine to infut 
data e€lements from a specified device or user declared file 
into a data unit or units. When the data unit is a whole 
Table the input fills the Table sequentially, word by word, 
If a format name iS not referenced, the input character 
string is converted to the type and length of the receptacle. 
The user file name feature was not implemented in this 
version of CMS-2kS, but wiil allow data to be mcved to and 
from internal and external devices. 

Examples: 

INPUT READ INTO:4,8,C85 

INPUT GSERK1 FORM1 INTO: TABLES 

j- CUTPUT Statement 
Syntax; 
<CUIPUT STATEMENTS: :=OUTPUT <FILE NAME> <FORMAT NAMEDS 
{OUTFULT <FILE NAMED CUTOF:<DATA UNITS$ 
(OUTEUL <rita NANEDY <FORMAT KANED OUTOF: 
<DATA UNITO$S - 
<FILE NAMr>33=PRINT 
{ PNCH 
{ OCM 
(IS USER $401 LE NAMED 

Semantics: 

The OUTPUT statement causes a mcnitor rcutine to cut- 
put data cr a character string to a specified device or user 
declared file from a data unit or units. When the data unit 
1s a whole Takle the OUTPUT statement empties the Table Iteas 
segquentially,word by werd. If a format name is not refer- 
enced, the output character string is truncated at 22 positions 


Per data unit. 
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Examples: 

CUTEUT PRINT "THIS IS AN OUTPUT STRING" 

OUTPUT OCM OUTOF2A,B,CS . 

CULPUT USER2 FORM2 OUTOF: TABLEU$ 

k. ENCODE and DECODE Statements 

syntax: 

<ENCCDE STATEMENTS: :s=ENCCDEsS<CHARACTER CATA UNITS <FORMAT 
NAME> OUTOF:<LTATA UNITO$ 

<CTECODE STATEMENTS: s=DECCDE:<CHARACTER DATA UNITS <FORMAT 
NAME> INTO:<LCATA UNITS 

Semantics: 

The ENCODE statement specifies that the scurce data 
unit(s) are to be ccnverted to character strings and packed 
seguentially into the receptacle according to a specified 
format. The DECODE statement is the reverse of the ENCODE 
Statement. 

Examples: 

ENCODE; CHARAC1T CUTOF:A,8,C 

DECODE:CHARAC1 INITO:A,B,CS 

1d. PACK and UNPCK Statements 

S¥yntax: 
<PACK STATEMENTS 32=PACK:<LTATA UNIT> OUTOF:<DATA UNITS 
<UNPCK STATEMENT>::=UNPCK:<DATA UNIT> INTO:<DATA UNIT>$ 


Semantics: 


te 


The PACK statement transfers a kit string froma 
source list of data units into a receftacle data unit. The} 
are stored consecutively from left-to-right and without 
Spacing. The receptacle and source data units may be var- 
jable to Field, Fieid to variable, varietle tc variable, 
Field tc Field, Table tc Table, Table tc variable, 

Variable to Table, Field to Table, Table to Field, 

Item to Item, Table to Item, Item to Tattle, Item to Field 
and Field to Item. ASSignmtent length ccmpatibility is the 
respcnsibility of the programmer and excess words will 


be truncated. 
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Examples: 

PLeOk+s thet cee OULTOF STAERLEZS 

PACK:A OUTOF:B,C,D% 

m. Erocedure Statement 

syntax: 

<PRCCELURE STATEMENTD :;=CALL <PROCEDURE NAME> 
<INPUT PARAMETERS>D 
<CUTPUT PARAMETERS D 
<LABEL PARANETERS>) $ 
[CALL <PROCEDURE NAME> 
<INEUT EARKAMETERS>::=( , 
| <EXPRESSION> ) 
<OUTEFUT FPARAMETERSOZ2=[|] , 
{ji <DATA UNIT> 
<LABEL FARAHETERS> 2 2=| | <LADEL NAME> 

Semantics: 

The procedure CALL statement transfers control tc 
a’naned frocedure and maps any actval parameters to the 
corresponding formal parameters. | 

All procedure input and output parameters must have 
been previously declared. Parameters are normally mapped by 
passing values, thus resulting ina cali by value. If the 
Same data element is used for output in both actual and 
formal farameters, then the effect is call by result. If 
an actual parameter 1s omitted, or if the same data element 
is specified as both the actual and fernal parareter then 
a c@all ty address occurs. If call by address 1s desired 
and the data elements are not the same, then the CCKAD 
cperator must be used to pass the address of variables or 
Subscripted variables. For Tables, the indirect declaration 
allows the Takkle address to Le passed tc the called procedure. 
Ey waxing any of the above conventions, calls by value- 
result cr address-result may occur. 

Examples: 

(1). Value 

TABLE 11,T 25 
CALL ALT2) We 


ee 





PROCELURE A(T1{ )$ 
(2) .° RESULT 

PAELE T1,T28- 

CALL A( |[T2)¢4 

EROCEDURE A ( {11)5 
(2). Value-result 

VRBL &,B,CS 

CALL DAMN & 

PROCEDURE D({BIB)$ 
(4). Address (Tables) 

LaAbiieus , 0 3o 

DNUIREG TAA Ie ee. inp 

Ome he (TD ELSES 

PROCEDURE A(T2|13)$ 


CALL A(T2{t )$ 
CALIE E(T 34 ) 5 
PROCEDURE A(T1] )$ 
(5). Address-result 
TABLE Tt,T2$ 
CALL A{T1]T2)$ 
EFROCEDURE A(T1]121) $ 
(6). Address (variables) 
Vinviog Ee bes 
CALL D(CORAL(A) } )$ 
CALL D(CORAD(E) | )$ 
_ PROCEDURE D(C{ )$ 
ne. GOTO Statement 
Syntax: 
<GCTC STATEMENT>::=GOTO <LABEL NAMED$ 
Semantics: 
The GCTO statement transfers ccntrol to a labeled 
Statement or statement bicck. 
Examples; 
GOTO LABEL1$ 
IF A=B THEN GOTO LABELZ$ 
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O- KETURNTO Statement 

Syntax: 
<RETURNIO STATEMENTS: :=RETURNTO <LABEL NANED>$S 

Semantics: 

The RETURNTO statement transfers contrcel kack te 
one cf the abnormal exit label parameters in a procedure 
call. 

Example: 

RETURNTO EXITLBL1$ 

Oem RPSUME Staak enent 

syntax: 
<RESUME STATEMENTS: >=RESUME <LABEL NANE>$ 

Semantics: 

The RESUME statement specifies a transfer to the 
increment and test step within a VARY Llock. 

Example; 

IF A=E THEN RESUME VARY 1$ 

pe RETURN Seatement 

Synta 

<RETURN STATEMENTS: s=RETURN 
{RETURN:<DATA UNITS 


Xs 


Semantics: 

The RETURN statement transfers control fron within 
a sub-routine to the point of call. The RETURN statenent 
may be cnitted if it is the last statement in a ;rocedure. 
To return from a Functicn, the RETURN data unit statement 
must be used. 

Example; 

KETURNS 

IF A=E THEN RETURNS 

IF A<B THEN RETURN:AS 

re. osoTOP Statement , 

Syntax: 
BOLOP STATEMENT>;: ; =STOP 

Semantics: 

The SIOP statement temporarily suspends program exe- 


cuticn until an operator manually restarts the computer. 
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Fxample; | 
IF INTERRUPT (1) THEN STOP$ 
5. Expressions 
ae Real Expressions 
Dy ata 
<REAL EXERESSIONS:3:=<REAL PRINARY> 


{<REAL EXPRESSION> <REAL OPEFATCH> 


<KEAL PRIMARY> 
| Symboi: Definition: 
<REAL OCEERATOR>D: ;=+ addition 


j- subtraction 

| * multiplicaticn 
lf division 

{REM remainder 

| ** exponentiation 
{ABS absolute value 


i- unary minus 
<REAL EFIMARY>::=<REAL DATA UNITS 

|<REAL CONSTANT> 

1(<REAL EXPRESSIOND) 


|<REAL FUNCTION NAME> <INPUT PARAMNETERSD) 
1 (<REAL EXPRESSTION>)..<REAL CONSTANT> 


[NUMBER (<BITS PRIMARY>) 


| CHARCODE (<CHARACTER EFRINARY>) 


| COUNT (<DATA UNIT>) 
{}CORAD (<DATA UNIT>) 


Semantics: 


The Real expression allows mixed mode arithmetic 
With integer, fixed-point and floating-point operands. 
Evaluation is from left-to-right in the order of the 
pricrities listed above. The REM operatcr provides the 


remainder after integer, fixed-point or floatinc-jcint di- 


visicn cf two operands. 
Examples: 
SET:A TO B REM NUMBER (C) $ 
SET: D TO —B¥*C*¥*E*CORAD (F)$ 
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b. Bits Expressions 
Syntax; 
<BITS EXERESSION>::=<BITS PHRIMARY> 
{<BIIS EXPRESSION> <LCGICAL CEERATOR> 
<EITS PRIMARY> 
{<BITS EXPRESSION> <SHIFT OEFEKATCK> 
<REAL PRIMARY> 


Symbcl: Definition: Priority: 
<LOGICAL GPERATOR>: 3=NOTL logical not Tt 
{ ANCL logical and 3 
{ORL logical or 3 
Son LFT CPERATCRD>::=SHLL shitt left logical 2 
{SHRL shift right logical 2 
{CIRS HLL Circular SHLL 2 
{CIRSHRL Cipetia Le ok 2 


<BITS FRIMARYS::=<BITS CONSTANTS 
1<BITS DATA UNIT> 
| (<BITS EXYPRESSIOND) 
(<BILTS FUNCTION WNANE> <INEFUILSPARAKETERS >) 
JBITSTRING<INPUT PAKAMETERSD) 

Semantics: 

The Bits expressions allow logical operations on 
binary cperands cf equal cr unegual length. The shift 
Operators aliow shiiting of a Bits operand an integral 
amount either end-off OF within the operand. All end-off 
shifts result in vacated bit positions Eeing assigned the 
value cf zero (0). The BITSTRING Function has farameters: 
(real primary, real primary,data unit). The first faran- 
eter 1S the starting bit fosition in the third farameter. 
The seccnd parameter 1S the number of bits to ke extracted 
from the third parameter. The result is a substring of 
bits free the data unit. 

Examples: 

Skis toes ANDi GC SHL lL 35 

ois D BO BITSTRING (4, 1058) 5 
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c. Character Expressions 

syntax: 

<CHARACTER EXPRESSION>:32=<CHARACTER PRIMAEY>D 
|<CHAKRACTER EXERESSIOND CAT 
<CHARACTERK PRIMARY> 

<CHARACTER PRIMARY>33:=<CHARACTER CONSTANT>D 

{<CHARACTER DATA UNIT> 

i (<CHARACTER EXPRESSIOND) 

[<CHARACTER FUNCTION NAMED<INPUT 

FARAMETERS>) 

{COCLECHAR(<REAL PRIMARY>) 

(SUBCHAR<INPUT FPARAMETERS>)) 

Semantics: 

The Character expression allows concatenation of 
succesSive character strings. The SUBCHAR functicn has 
paragters: (real primary, real primary, character primary). 
The first parameter is the starting character fcsition in 
the third parameter. The seccnad parameter 1S the number of 
Characters to be extracted from the third patameéeter. The 
result is a substring of character symbols from the character 
primary. 

€. Eoolean Expression 

Syntax: 
<EOCLEAN EXPRESSION>: s=<ECOLEAN PRIMARY? 

[<EOCLEAN EXPRESSIOND>D <ECOLEAN 
OPERATOR> <ECOLEAN EFIMARY> 


Symbol: Definiticn: Eriority: 
<EBOGLIEAN CPERATOR>D: :=NOT complementation Zz 

| AND conjunction 3 

LOR disjunction 3 


<BOOLEAN PRIMARY>::=<BOOLEAN CONSTANT> 
{<BOOLEAN DATA UNIT> 
1 (<EOOLEAN EXPRESSION>) 
|<BOOLEAN FUNCTION NAMED><INEUT 
FARAMETERS>) 
1<RELATICNAL EXPRESSION> 
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<RELATICNAL EXPRESSIONS: :=<REAL PRIMARYD> <RELATICNAL OPERATOL> 


<REAL PRINARY> 
|<ELIS PRIMARY> <RELATICKAL 
OPERATOR> <BITS PRIMARY> 
|<CHARACTER ERIMARY> <RELATIONAL 
OPERATOR> <CHARACTIER PRIAMRY> 
}<STATUS EXPRESSION> <RELATIONAL 
OPERATOR> <STATUS EXPRESSION> 


Symbol: Definition; PEVOEIt y4 
<REILATICNAL OFERATOR>D: := less than 1 

|> greater than 1 

| = equal 1 

|-= not equal 1 

{<= less than or equal 1 

| >= greater than cr equal 1 


Semantics: 


The Boolean expressions allow logical cc@rEarison cf 


two cr mcre operands for TRUE of FALSE conditions using 


Boolean operators. The relational expressions allow 


compariscn of Real, Bits, Character or Status expressions. 


d. 


<STATUS 


stants, 


Example: 
IF A=B AND C=D THEN SET:A TO D$ 
Status Expressions 
Syntax: 
EXPRESSION>::=<STATUS CONSTANT> 
|<STATUS DATA UNIT> 
J<STATUS FUNCTION NAMED><INEUT EARAMETERS>)) 
Semantics: 
The Status expression allows the use of Status con- 
data units and functions in dynamic statements. 
Examples: 
SETSWEATHER TO ‘HOT! $ 
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APPENDIX, C 


SAMPLE CMS-2RS FRCGRAWS 


SAMPLES OF DATA 


SYSTEM ALPHA ¢ 
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EXT Robe 8S 


A2 $ 


Al; 


NTEGER(16) 
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DURE P2 $ 


ION FIXEC( 32,16) FACT( INS, IN4) $ 
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PROG & 2 


ENC $ 4 END CF SYS 
END $9. END OF SYSTEM 


ALPHA & 


SYSTEM BISEARCH $ 
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PROCE 
BEGIN 
lia! 
BUS E 
ELS 
END $ 
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D 
% 
END $ 2 END CF SYS 


PROC SRCF &@ 


ENO $ @ END OF BISEARCH @ 





APPENDIX D 
DESCRIPTORS! PROTOCOL 


Due tc the variety of conditions that need be described, 
it beccmes necessary to recognize several descriptor 
formats. These formats are detailed here to facilitate 
future work in compieting this compiler. Unless ctherwise 
specified, the header byte in every descriptor gives the 
number CL bytes that follow. Fields are described 


sequentially as they appear in the descriptor. 


1. Parareterless Procedure 


A descriptor for a parameterless procedure consist of 
the heacer kEyte followed by a four-byte field to store the 


entry pfcint address of the procedure. 


2. Procedure kith Parameters 


2 a 


A descriptor for a procedure with parameters consist of 
the header byte, a four-byte field to store the entry point 
address cf the procedure, one byte to indicate the type and 
number cf farameters, and a number of two-kyte fields, one 
for each farameter, to store fpointers to the parameters' 
entries in the identifier directory. 

The twe high order bits of the type-number byte indicate 
the .type of parameters that follow according tc the 
following ccde: 00 fcr input paraneters) O1 £01 output 
parameters, and 10 for label parameters. The six lcw order 
bits remaining indicate number of parameters (64 maximum). 

Diceeaiey ic auUnCC ENN DY tCmmGogcinictmw ith thiempointer fields 
fcrg a grcup. Up to three such groupS can appear in the 
descriptor, o9ne for each of the three types of parameters, 
INFECT, CUIPUI, and LABEL. 


113 





3. Function Name 


ne te eg ey 


A functicn name descriptor consist of the header tryte 
and a fcur~kyte field to stcre the entry feint address of 
the procedure. The remainder of the descriptor varies in 
format among the five possipnple types of functions as 
fellecws: 

a. Numeric Function 

The entry point field is followed by a twe-kbyte 
field for data type description. The twe high order bits of 
the first byte indicate the type as follows: 

QC for integer 
01 for fixed 
nor LOrmiiCac. 

The next 14 bits of the field are used tc. store 
value of the data size parameter in data type declaraticns. 
In the case cf type FIXED where two such parameters are 
used, the seven high order fits are used for the first 
parameter and the seven low crder bits for the second. 

The data type field is followed by a one-byte field 
to indicate the number of formal parameters, and a number of 
two~byte fields, one per formal Parameter, to store [fcinters 
to the parameters! entries in the identifier directory, 

ie iets Function 

Bits function descriptors are the same aS numeric 
functicn descriptecrs except that all 16 bits of the data 
type field are used to store the value of the data size 
parameter. 

Soomenaracter Function 

Character function descriptors are the same as bits 
functior descriptors. 

Gee soCatuSs €uUnceion 

Pic Son thyme 1 old m1 Smee rOllawed | Fym@ a  ene-Lyte 
field tc indicate the present value of the status function, 


1.€., the number of the status constant assigned. This 





field is followed by a number of two-byte fields, cne per 
status ccnstant, to store pointers to status constants in 
the identifier directory. A one-byte field to indicate the 
numker of formal parameters and a number of two-byte fields 
for feinters to such parameters follow. At declaration 
time, the ccnfiler associates an integer with each status 
constant in ascending order beginning with 1. This number 
is stored in the descriptor pecinter field of the status 
censtant's entry in the identifier directory. When a status 
Variatle is set to a status constant, the number assigned to 
that constant by the compiler is entered in the fresent 
Value byte of the descriptor. 
e. Bcocolean Function 

Ecolean function name descriptors need no data type 

field. Cther than that, the descriptors are the sale as 


those descriked abcve. 


Tags do nct need descriptors. Instead, the deéscriftor 
pointer field in the identifier directory has the pointer to 


the numeric value of the tag in the constant table. 


9. Variables 


Numeric, kits, and character variables have the same 
déscripter format. It consists of the header byte, two 
bytes fcr the data type field as described before, and a 
two-byte field for a pointer to the value in the constant 
table. This last field will have .an unpredictable value 
unless the yariable has been initiallized. 

Status variables have the following descriptor format: 
header byte, one byte to indicate present value cf the 
Status variaktle, 1.e., the number of the status constant 
assigned, and a number of two-byte fields, one per each 
Status constant, to store pointers to status constants in 


the identifier directory (see status function above). 


ees 





Boolean variables, like boolean functions, do not need 
descriptors. The descriptor pointer field is uSed to store 
the value of the variable; 1 for TRUE and O for FALSE. 


6. Format Declaration Descriptor 
ee ce ee Se ee re Me me ce A A a SN SS ae 


Format declaration descrirtors consist of the header 
byte, cne byte for the carriage control field, anda 
Sequence or format descriftor fields. There are five 
different types of format descriptor fields. The high crder 
three bits of each field are used aS a key to indicate the 
type of field as follows: 

001 for numeric conversion format field 
010 for character constant field 
0711 fcr slash list field 
100 for repetition grcup head field 
101 for repetition group taii fieid. 
a. Numeric Conversion Format Field 
This f1ei1a is four bytes long. The first Lyte has 
the fielc key and the type ccnversion letter. This letter 
1s represented by the low order five bits in the byte 
according to the following code: 
01001 for I : 
0€110 for 
00101 f0r 
10}1{0 for 
01000 for 
06001 for 
10011 for 
O Calg leet OF 
000171 for 


The second Eyte has the value cf the repetition 


on Eo on Eo OME SOE es 2 


Bactcer “np gnethe numeric converSion group (niw.d). It can 
have any value ketween 1 and 255. The third byte has the 
value of the "w" varameter, and the fourth byte that cf the 


"d" parameter. Tae "d" parameter can ke zero. 
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b. Character Constant Field 
The character constant field has three bytes, fhe 
first byte has the key field. The second and third bytes 
have a fointer to the character constant in the constant 
table. 
Cemmolash List Ficid 
The slash iist field has gone byte. The three high 
order kits have the field key. The five lcw order bits give 
the number of slashes in the list. 
d. Repetition Group Head Field 
The repetition group head field has two bytes. The 
three high crder bits have the field key. The remainder 
thirteen bits have the value of the group repetition factor 
De 
e. kepetition Group Tail Field 
The reépetiticn group taii field has one byte with 
the field key. All the descriptor fields bracketed ky the 
repetition group aead and tail fields wiil be repeated "mii 


times when implementing the format. 


A takle descriptor consist cf the header byte, one byte 
fer the numbér of dimensions field, a number of tweo-byte 
fields, one for each dimensicn, tc store the value of the 
dimensions, and a two-byte field to store the pointer to the 
field pointer list in the constant table. The high order 
first ‘bit cf the number of dimensions field is used to 
indicate if the table is INDIRECT or not. This bit is set 
tc "4" for INDIRECT TAELE declarations; it is reset to "Q" 
Otherwise. The remaining seven bits are used to store the 
Dumber cf dimensions (128 maximum). The field pointer list 
is a pseudo-descrifptoar consisting of a header byte and a 
numter cf two-byte fields, cne fer each field declared in 
the table declaration block, with pointers to the field name 


entries in the identifier directory. 
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Numeric, bits, and character field names have the same 
format. It consists of the header byte, twe bytes for the 
data type field, and a two-byte field for the pointer to the 
value in the constant table. If the field 1s initialilized 
at declaration time to a tJlist of values, the folicwing 
additional] fields are part of the descriptor: a two-byte 
field to store the number of values in the initiallization 
list, and a two-byte field to stcre the list repetition 
factor m. The last two pytes of every field descrifptcr have 
a pointer tc the parent table descriptor in the constant 
table. 

Status field names have the same descriptor format as 
that descrived above except for the data type field. This 
field was previously described (see status function). 

Boolean field names have the Same descriptor fcrnrat as 
those already described except for the absence of a data 


type field. 


9. Itemrareas 


Item-areas do not need descriptors. The descriftotr 
pointer field in the identifier directory has a pointer to 
euceparent table descriptor which, in turn, has a pointer to 


the field fointer list. 


10. Subtablesé 


A suktakle descriptor ccnsists of the header Eyte, a 
number cf two-byte fields, one per dimension, to stcre their 
values, and a two-byte field for a pointer to the _  farent 


takle déscriftor. 
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SE ae ge ee 


ADL 
ADC 
SUE 
2G 
MUL 
MLC 
DIV 
DVC 
REM 
LST 
EXP 
ABS 
SCL 
CNY 
NEG 
MCD 
PRY 
BLS 
BLQ 
BEC 
ENC 
. BGQ 
BGK 
ERE 
BRI 
ERU 
NCE 
STP 
LDV 
LUA 
STV 
STA 


APPENDIX E 


OPERATION CODES FOR IL 


add operand 

add constant in operand field 
Subtract operand 

subtract constant in operand field 
multiply times operand 
multiply times constant in operand field 
divide by operand 

divide by constant in operand field 
remainder 

compare 

exponentiation 

absolute value 

scale 

count ones 

unary minus 

modulo 

compare 

branch on < 

Eranch on <= 

branch on = 

branch on -= 

brach on >= 

branch on > 

branch on false 

branch on true 

unconditional branch 

no operation 

stop 

loaa value of operand 

load address of operand 

store value of operand 


store address of operand 


sD Re, 





ANL 
ICR 
XOR 
NCI 
INX 
XCH 
CAT 
INE 
IRE 
OTE 
OPE 
DEF 
LNE 
ESE 
‘LSP 
ESK 
LSR 
EPR 
LPR 
EF N 
LEN 
FRC 
ENC 
INC 
EVL 
Liv 1 
EWL 
LWL 
FCG 
ECG 
DCL 
ERB 
LRE 
EBE 
LEE 
ECB 


logical and 

inclusive logical or 
exclusive logical or 
laegical complement 
subscript computation 
exchange operands 

string concatenation 
input from reader into operand 
input from reader into operand with format 
Output to printer operand 
Qutput to printer operand with format 
define label location 
source line number 

EN@eEE SYSpPrOc 

ieave sysproc 

enter sysprocreen 

leave sysprocreen 

enter procedure 

leave procedure 

enter function 

leave function 

procedure call 

function call 

increment 

enter vary loop 

leave vary loop 

enter while locrp 

leave while loop 

enter case grour 

leave case group 

define case label 

enter reserved block 
leave reserved block 
enter segin block 

leave ktegin block 


enter cobegin block 
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LCE 
RVL 
RIL 
PCK 
UNE 
ENC 
DEC 
SRC 
SHI 
SHR 
Coe 
CSR 
INC 
[OF 
“OTN 
ONE 
OTC 
OOF 
INU 
OTU 
BIL 
LTE 
JEIGE 
oT ts 
aati 4 


“XTI 


ITi 
LTI 
XTi 
LTX 
OTD 
DTO 
XTB 
BTX 
LTE 
ETL 


leave cobegin block 

resume vary Loop 

retyrn to label 

pack into data unit 

unpack into data unit 

encede into data unit 

decode into data unit 

search for data unit 

shitt ieft amount specified in operand 
shift right aticunt specified in operand 
Circular shift left 

Circular shift right 

input from operator controlled medium 
input from OCM with format 
output to punch 

output to punch kith format 
output to Ooch 

output to OCM with format 
input from user file 

outrut to user file 

convert binary to integer 
convert integer to binary 
convert integer to character 
convert character to integer 
convert integer to fixed 
convert fixed tc integer 
convert integer to flioat 
convert float tc integer 
convert fixed to float 
convert float to fixed 
convert octal to decimal 
convert decimal to octal 
convert fixed to binary 
convert binary to fixed 
convert float to binary 


convert binary tc float 
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LEE length in binary digits 
LEY Jength in bytes 
LWL length in words 
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